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Protect India’s 
universities 


The government and state authorities must 
step in and stop violent attacks on campuses. 


or several weeks the world has watched as India’s 

citizens — including academics and students — 

have taken tothe streets. Tens of thousands have 

been gathering to read out the preamble to the 

Indian constitution, as a mark of protest against 
a discriminatory new citizenship law. 

The law provides a path to citizenship for recent refugees 
from Afghanistan, Bangladesh and Pakistan. It is a means 
to providing permanent sanctuary for religious minorities 
fleeing hardship or persecution in these countries — an 
intention that is to be commended. Whatis troubling is that 
decisions on who can — and cannot — apply for citizenship 
will be made on the basis of religious belief. Muslims are to 
be excluded, whichis a violation of the foundational prin- 
ciple that people of all faiths and none must be equal inlaw. 

Tragically, some of the peaceful protests are being met 
with violence, and university campuses are not immune. 
The latest high-profile incident took place at Jawaharlal 
Nehru University (JNU) in New Delhi, where students have 
also been protesting over an increase in accommodation 
fees. On the evening of 5 January, people wearing masks 
and carrying iron rods, stones and wooden clubs entered 
the campus and launched anattack. The city’s police failed 
to provide protection, according to the international advo- 
cacy organization Human Rights Watch. 

Videos of bloodied and bruised students and staff have 
been widely shared. Surya Prakash, a student of Sanskrit 
texts who is visually impaired, was beaten in his room. And 
Sucharita Sen, a researcher at JNU’s Centre for the Study 
of Regional Development, confirmed to Nature that she 
was hit on the head “witha stone the size of half a brick”. 

In mid-December, police entered two of India’s older 
universities — Jamia Millia Islamia in New Delhi and 
Aligarh Muslim University in the neighbouring state of 
Uttar Pradesh. Students were beaten, property was dam- 
aged and tear gas used. Both institutions had to close tem- 
porarily, disrupting teaching and research. Jamia Millia’s 
vice-chancellor, Najma Akhtar, said that it is not acceptable 
for police to harm innocent students. 

The severity of the police action has rightly prompted 
achorus of international concern. Among those speaking 
up are the Nobel prizewinners Abhijit Banerjee, an econo- 
mist and JNU alumnus nowat the Massachusetts Institute 
of Technology in Cambridge, and Venki Ramakrishnan, a 
biologist and the president of the Royal Society in London, 
who received his undergraduate education in India, and 
whois also critical of the new law. 

Many of the government’s supporters are upset that 
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university students, academics and scientists are also 
opposing the new law. But they must know that freedom 
of expression is core toa university’s mission; that the abil- 
ity of citizens to protest peacefully against government 
policies is a right, nota privilege; and that the state should 
provide protection for such dissent. Without it, no oppo- 
sition would be able to present its case to the public — as 
members of the current government and its supporters 
did inthe years they were out of power. 

Academics in India and around the world are right to be 
alarmed and to speak up, because force has been used on 
university campuses, causing fear. India’s authorities must 
take the necessary steps to protect their nation’s universi- 
ties and their people’s freedom of speech. 

They must heed the words of the prime minister’s 
principal scientific adviser, Krishnaswamy VijayRaghavan, 
who said unequivocally: “Campuses are places for learning, 
discussion, collegial debate amongst diverse opinions, and 
research. There is no place at all for violence.” 


Solve hunger with 
systems thinking 


Feeding the world involves tackling all aspects 
of the food system. 


hat would the world be if there were 
no hunger?” It’s a question that the 
late ecologist Donella Meadows 
would ask her students at Dart- 
mouth College in Hanover, New 
Hampshire, back inthe 1970s. They found it hard to answer, 
she later wrote, because imagining something that isn’t 
part of real life — and learning how to make it real — is a 
rare skill. Itis taught to artists, writers and engineers, but 
much less often to scientists. Meadows set out to change 
that, and helped to create a global movement. The result 
— an approach known as systems thinking — is now seen 
as essential in meeting big global challenges such as the 
Sustainable Development Goals. 

Systems thinking is crucial to achieving targets such 
as zero hunger and better nutrition because it requires 
considering the way in which food is produced, processed, 
delivered and consumed, and looking at how those things 
intersect with human health, the environment, econom- 
ics and society. Doing this is genuinely difficult, but it’s 
not impossible if the barriers are known. Some of these 
obstacles — along with potential solutions — are explored 
this week in a series of articles in the first issue of Nature 
Food, one of three journals in the Nature Research portfo- 
lio (along with Nature Cancer and Nature Reviews Earth & 
Environment) to launch this month. 

According to systems thinking, changing the food 
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system — or any other network — requires three things to 
happen. First, researchers need to identify all the players 
inthat system; second, they must work out how they relate 
to each other; and third, they need to understand and 
quantify the impact of those relationships on each other 
and on those outside the system. 

Take nutrition. Inits latest report on global food security, 
the United Nations Food and Agriculture Organization says 
that the number of undernourished people in the world 
has been rising since 2015, despite great advances in nutri- 
tion science. For example, tracking of 150 biochemicals 
in food by the US Department of Agriculture and various 
databases has been important in revealing the relationships 
between calories, sugar, fat, vitamins and the occurrence of 
common diseases. But using machine learning and artificial 
intelligence, network scientist Albert Laszl6 Barabasi at 
Northeastern University in Boston, Massachusetts, and 
his colleagues propose that human diets consist of at least 
26,000 biochemicals — and that the vast majority are not 
known (Nature Food 1, 33-37; 2020). This shows that we 
have some way totravel before achieving the first objective 
of systems thinking — which, in this example, is to identify 
more components of the nutrition system. 

Asystems approach to creating changeis also built onthe 
assumption that everyone in the system has equal power 
and status — or agency, to use the academic term. But as 
health-equity researcher Sharon Friel at the Australian 
National University in Canberra and her colleagues show, 
the food system is not an equal one, and the power of 
world trade can override environmental and nutritional 
needs (S. Friel etal. Nature Food 1, 51-58; 2020). Countries 
need to pass relevant laws and regulations to meet global 
goals for nutrition and climate change. But this becomes 
difficult because the global trade rules set by the World 
Trade Organization (WTO) are legally binding on coun- 
tries, whereas policies on climate change or nutrition are 
often not. 

The need for a global counterweight to the WTO has 
led to calls for a World Environmental Organization (see, 
for example, go.nature.com/2th18yc). Another way to 
redress such power imbalances is for more universities 
to do what Meadows did and teach students how to think 
using a systems approach. 

A team of researchers has done just that, through the 
Interdisciplinary Food Systems Teaching and Learning 
programme (J. Ingram et al. Nature Food 1, 9-10; 2020). 
Students from disciplines including agriculture, ecology 
and economics learn together by drawing on their collective 
expertise in tackling real-world problems, such as how to 
reduce food waste. Since its launch in 2015, the programme 
has trained more than 1,500 students from 45 university 
departments. 

More researchers, policymakers and representatives 
from the food industry must learn to look beyond their 
direct lines of responsibility and embrace a systems 
approach, as the editors of Nature Food advocate in their 
launch editorial (Nature Food 1, 1;2020). Meadows knew 
that visions alone don’t produce results, but concluded 
that “we'll never produce results that we can’t envision”. 
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The life of archaea 


Cultivation of Asgard archaea brings us closer 
to understanding how complex life evolved. 


ilaire Belloc’s ‘The Microbe’ opens with the 
words: 


The microbe is so very small, 
You cannot make him out at all. 


The poem lists the wonders of microorganisms, and they 
continue to reveal their secrets to researchers morethana 
century after his book The Bad Child's Book of Beasts (1896) 
excited and delighted children. 

In 2015, researchers published the metagenome of a 
member of the Asgard group of archaea called Lokiar- 
chaeota (A. Spang et al. Nature 521, 173-179; 2015). These 
are descended from an ancient lineage of archaea, simple 
cells lacking a nucleus and distinct from bacteria. This 
discovery was exciting because the genes were found to 
have similarities with those of eukaryotes — the group of 
organisms whose cells have nuclei and other structures, 
and which include plants, fungi, humans and other animals. 
That suggested a stronger connection between archaea 
and eukaryotes than had previously been thought. 

Now, after a heroic effort that took 12 years, researchers 
led by Hiroyukilmachi, a microbiologist at the Japan Agency 
for Marine-Earth Science and Technology, Yokosuka, have 
successfully grown a new Asgard lineage (H. Imachi et al. 
Nature https://doi.org/10.1038/s41586-019-1916-6; 2020). 
This achievement puts to rest concerns that the genes 
sequenced in 2015 were the result of contamination, or 
the initial sample being a mix of cells. 

Imachiand his colleagues grew cells from sediment that 
had been collected below the sea bed. But why did the cells 
take so long to grow? The problem in culturing cells from 
sediment is that most microbes aren't as obliging as famil- 
iar lab workhorses such as Escherichia coli. The research- 
ers took up the challenge and with much patience, trial 
and error, they found that the cells grew best on a diet of 
peptides, amino acids and even baby-milk powder. 

The resulting cells are tiny spheres 300-750 nanometres 
in diameter, but they often extrude longer, branched fila- 
ments that reach out to meet neighbouring bacteria. The 
researchers think that sucha partnership, both biochem- 
ical and physical, could tell us more about the processes 
that led to the eukaryote cell being formed — a question 
more researchers must surely try to tackle. 

Despite the promise of what is to come, a degree of cau- 
tion is needed. Eukaryotes evolved more than two billion 
years ago, possibly coincident with an episode of global 
climatic change called the Great Oxidation Event. None- 
theless, the achievement brings us closer to meeting living 
relatives of our ancestors. We await the next chapter with 
anticipation. 
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CLIVE SHERLOCK 


A personal take on science and society 


World view 


By Hetan Shah 


Global problems 
need social science 


Without human insights, data and the hard 
sciences will not meet the challenges of the 
next decade, says Hetan Shah. 


t the beginning of the year, Dominic 

Cummings, a senior adviser to the UK govern- 

ment, posted an unusual advert on his blog, 

calling for data scientists, mathematicians and 

physicists to join him at the heart of govern- 
ment. As outgoing director of the Royal Statistical Society 
in London and soon-to-be chief executive of the British 
Academy, I support the sentiment behind the call: data do 
have huge power to inform government policy. 

ButI worry about the fact that the call prioritized science 
and technology over the humanities and social sciences. 
Governments must make sure they also tap into that exper- 
tise, or they will fail to tackle the challenges of this decade. 

For example, we cannot improve global health if we 
take only a narrow medical view. Epidemics are social as 
well as biological phenomena. Anthropologists such as 
Melissa Leach at the University of Sussex in Brighton, UK, 
played animportant part in curbing the West African Ebola 
epidemic with proposals to substitute risky burial rituals 
with safer ones, rather than trying to eliminate such rituals 
altogether. 

Treatments for mental health have made insufficient 
progress. Advances will depend, in part, ona better under- 
standing of how social context influences whether treat- 
ment succeeds. Similar arguments apply to the problem 
of antimicrobial resistance and antibiotic overuse. 

Environmental issues are not just technical challenges 
that can be solved with a new invention. To tackle climate 
change we will need insight from psychology and sociology. 
Scientific and technological innovations are necessary, 
but enabling them to make an impact requires an under- 
standing of how people adapt and change their behaviour. 
That will probably require new narratives — the purview of 
rhetoric, literature, philosophy and even theology. 

Poverty and inequality call even more obviously for 
expertise beyond science and maths. The UK Economic and 
Social Research Council has recognized that poor produc- 
tivity inthe country is a big problem, and is investing up to 
£32.4 million (US$42 million) ina new Productivity Institute 
inan effort understand the causes and potential remedies. 

Policy that touches on national and geographical 
identity also needs scholarly input. What is the rise of 
‘Englishness’? How do we live together in a community 
of diverse races and religions? How is migration under- 
stood and experienced? These intangibles have real-world 
consequences, as demonstrated by the Brexit vote and 
ongoing discussions about whether the United Kingdom 
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has a future as a united kingdom. It will take the work of 
historians, social psychologists and political scientists 
to help shed light on these questions. I could go on: fight- 
ing against misinformation; devising ethical frameworks 
for artificial intelligence. These are issues that cannot be 
tackled with better science alone. 

Consider how life-enhancing — and even life-saving — 
technologies have failed to be taken up. ‘Vaccine hesitancy’ 
is morea social phenomenon than atechnical one, and the 
main cause of measles resurgence. Solutions depend not on 
medical breakthroughs, but on insight from anthropolo- 
gists such as Heidi Larson at the London School of Hygiene 
& Tropical Medicine, who have done much to understand 
people’s decisions about whether to vaccinate themselves 
and their children. 

In diverse cases, social factors — cultural norms, edu- 
cational understanding, kin and social networks, power 
dynamics, or simply the layout of a building — must be 
accounted for before policy can succeed. Blind faith in 
data science without an understanding of what data are 
missing, or how algorithms can exacerbate existing biases, 
can lead to policy failure. 

A good example of the incorporation of appropriate 
expertise is the UK government’s Behavioural Insights 
Team, which has run more than 750 projects around the 
world, in particular randomized control trials of policy 
interventions. Work on the treatment of tuberculosis in 
Moldova brought the rate of adherence to daily medical 
regimes up from 44% to 84%. 

Downing Street is right to look for data scientists, but 
must not overlook the benefits brought by the humanities 
and social science. This expertise is embedded in existing 
staff and structures and should not be overlooked. There 
are many successes, from the creation of the ‘nudge units’ 
to the use of social-science expert panels. Detailed policy 
histories — compiled by agencies suchas the British Acad- 
emy and the UK Institute for Government — can provide 
surprising and valuable insights. 

More could be done to connect the policy community 
with external social science and humanities expertise. 
Chile’s innovation-focused Government Laboratory, the 
Bridging the Gap programme in Washington DC, and the 
Centre for Science and Policy in Cambridge, UK, have all 
used a variety of mechanisms — among them workshops, 
funding schemes and policy fellowships — to bring expert 
voices into the policymaking process. 

Ina democracy, expert advice must be balanced with 
considerations such as public opinion, financial costs 
and political demands. Still, without the humanities and 
social sciences, hard science and technology can do little 
to resolve complex societal challenges. Wise governments 
will find ways to incorporate that insight. 
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The world this week 


Newsin brief 


PREDATORY-JOURNAL PAPERS 
HAVE LITTLE SCIENTIFIC IMPACT 


Papers published in ‘predatory’ 
journals attract little attention 
from scientists, and get cited 
much less than those in reputable 
publications, an analysis shows. 

Predatory journals charge 
authors high article-processing 
fees, but don’t provide expected 
publishing services, such as peer 
review. Researchers have long 
voiced fears that these practices 
could be harming research by 
flooding the literature with 
poor-quality studies. 

But the authors of the 
21 December analysis say their 
findings suggest papers in 
predatory journals have a “very 
limited readership among 
academics” (B.-C. Bjork etal. 
Preprint at https://arxiv.org/ 
abs/1912.10228; 2019). 

The researchers picked 
250 predatory journals from 
more than 10,000 titles ona list 
of such publications curated by 
Cabells, a publishing analytics 
company in Beaumont, Texas. 
They then selected one paper 
published in 2014 from each 
of the 250 journals. Using the 
Google Scholar search engine, 
they manually checked how many 
times each paper had been cited 
since its publication. 


Around 60% of the papers 
hadn’t attracted any citations 
at all, and 38% were cited up to 
10 times. Less than 3% of the 
papers attracted more than 
10 citations, and none got more 
than 32 citations (see ‘Limited 
readership’). 

The lack of citations 
could indicate that the harm 
predatory-journal articles cause 
might have been exaggerated, 
says Bo-Christer Bjork, an 
information-systems scientist at 
the Hanken School of Economics 
in Helsinki who co-authored 
the study. “If people don't cite, 
they probably don’t read those 
articles,” he says. 

The results aren't surprising 
— not many academics thought 
predatory-journal papers 
were highly cited, says Matt 
Hodgkinson, head of research 
integrity at the open-access 
publisher Hindawi in London. But 
he argues that predatory journals 
still pose a threat to science and 
to scholarly publishing in several 
ways. They trick researchers and 
institutions out of payments, 
refuse to reject flawed papers 
and tarnish the reputation of 
legitimate open-access journals, 
he says. 


LIMITED READERSHIP 


Papers published in predatory journals five years ago have attracted few or no citations. 


22-32 citations 1.6% 
11-21 citations 1.2% ———————__ 


2-10 citations 24.8% 


1 citation 12.8% 


SAMPLE SIZE 


250 papers 


No citations 
59.6% 
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ANIMAL-CLONING 
SCIENTIST GETS 
PRISON SENTENCE 


Leading animal-cloning 
researcher Li Ning has been 
sentenced to 12 years in prisonin 
China for allegedly embezzling 
research funding. 

Li’s team famously engineered 
cows to produce milk containing 
a human milk protein (B. Yang 
etal. PLoS ONE 6, e17593; 2011). 

On 3 January, acourt in Jilin 
Province found that Li, formerly 
a researcher at the China 
Agricultural University in Beijing 
and a member of the Chinese 
Academy of Engineering (CAE), 
had stolen 34.1 million yuan 
(US$4.9 million) in research 
grants, and invested the money 
in his own companies, according 
to Xinhua, China’s state news 
agency. His former assistant, 
Zhang Lei, received a sentence 
of more than 5 years for 
allegedly helping. 

Zhang admitted to the 
charges, according to Xinhua. 
But Li denied stealing the 
money, and said that he had 
invested unused grant funding 
with the intention of using it 
for future research, according 
to the Chinese newspaper 
Economic Observer. 

Li’s lawyer did not respond to 
a request for comment. Some 
Chinese media reported that Li 
is likely to appeal. 

In 2018, 15 members of the 
CAE and the Chinese Academy 
of Sciences urged the president 
of China’s supreme court to 
finalize Li’s case and praised his 
research achievements. 
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EARTH-SIZED 
EXOPLANET SPIEDIN 
‘HABITABLE ZONE’ 


Astronomers have discovered a 
world only alittle bit bigger than 
Earth, whirling around a bright 
star about 31 parsecs from our 
planet. The world, known as 
TOI 700 d, orbits inits star’s 
‘habitable zone’ — the regionin 
which liquid water could exist. 
Astronomers know of only a 
handful of such worlds. 

“We don’t have that many 
Earth-sized planets in the 
habitable zone,” says Elisa 
Quintana, an astronomer at 
NASA’s Goddard Space Flight 
Center in Greenbelt, Maryland. 
“Having one around a nearby 
bright star is exciting,” she adds, 
because it is easier to study 
planets around nearby stars 
than around distant ones. 

Emily Gilbert, an astronomer 
at the University of Chicago 
in Illinois, and her colleagues 
discovered the planet (depicted 
below) using NASA’s Transiting 
Exoplanet Survey Satellite 
(TESS). It is the first Earth-sized 
planet discovered by TESS that 
lies in its star’s habitable zone. 
Gilbert reported the discovery 
on 6 January at a meeting of the 
American Astronomical Society 
in Honolulu, Hawaii. 

TESS, which launched in 2018, 
sweeps the night sky, looking for 
stars that periodically dim as an 
orbiting planet passes in front 
of them. It has found more than 
1,500 planet candidates using 
this method. 


LTO R: SOURCE: B.-C. BJORK ET AL. PREPRINT AT HTTPS://ARXIV.ORG/ABS/1912.10228 (2019); 
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Researchers have identified a new virus as the cause of a 
respiratory illness that has affected dozens of people in 
China, one of whom has died. One case of the virus has 
also been detected outside China, in Thailand. 

The pneumonia-like illness surfaced last December, 
mostly in people who worked at or regularly visited a 
live-animal and seafood market in the city of Wuhan, 
China. 

On 9 January, Chinese state media reported that 
scientists had sequenced the genome of the culprit: a 
previously unknown member of the coronavirus family, 
which also includes the virus that causes the highly con- 
tagious severe acute respiratory syndrome (SARS) that 
killed hundreds of people in China in 2002-03. China 
has now publicly shared the virus’s genetic sequence. 

Forty-one people have been confirmed as being 
infected with the virus. The World Health Organization 
(WHO) says that the infected person in Thailand had 
travelled there from Wuhan. Authorities in Hong Kong 
and South Korea have been screening travellers (see 
picture) who have recently been to Wuhan. 

There is noclear evidence of human-to-human 
transmission, the WHO says. Scientists suspect that an 
animal is passing the virus to people. 


MIT RELEASES 
REPORT ONEPSTEIN 
DONATIONS 


Sex offender and alleged 

sex trafficker Jeffrey Epstein 
donated US$850,000 to the 
Massachusetts Institute of 
Technology (MIT) in Cambridge 
between 2002 and 2017, and 
visited the prominent US 
university at least nine times. 

Those are the findings of an 
investigation conducted by law 
firm Goodwin Procter on the 
university’s behalf. MIT released 
the report on 10 January. 

MIT president Rafael Reif did 
not know that the university was 
accepting money from Epstein 
while it was taking place, the 
report found, but three senior 
administrators drew up an 
“informal framework” in 2013 
to accept money from Epstein. 
“No Senior Team member 
violated any law, breached any 
MIT policy, or acted in pursuit of 
personal gain in connection with 
Epstein’s donations,” the report 
says. 

In 2008, Epstein pleaded 
guilty in Florida to two felony 
charges of soliciting a minor for 
prostitution and served more 
than a year in prison. He died by 
suicide in August while awaiting 
trial on federal charges of 
trafficking under-age girls. 

The bulk of Epstein’s 
donations to MIT occurred after 
his guilty plea. The report found 
that former director of the MIT 
Media Lab Joi Ito and mechanical- 
engineering professor Seth Lloyd 
were key to maintaining the 
relationship with Epstein. 

Lloyd received $225,000 in 
research funds and $60,000 asa 
personal gift. He “purposefully 
failed to inform MIT” that 
Epstein was funding his work, 
the report said. MIT has placed 
Lloyd on paid administrative 
leave. “Just heard myself and so 
can’t comment right now,” Lloyd 
said. Ito did not respond toa 
request for comment. 
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The world this week 


News in focus 


In Loving Memory 


Our family members and friends 
Who lost their lives in 
the tragic accident 
of Ukraine Airlines flight crash 


CANADIAN UNIVERSITIES 
MOURN RESEARCHERS WHO 
DIED INIRAN PLANE CRASH 


Victims include two computer scientists returning from 
their wedding in Iran and a student about to begin his PhD. 


By Nidhi Subbaraman 


anadian universities are mourning 
more than two dozen faculty mem- 
bers and students who died when 
Ukraine International Airlines flight 
PS752 crashed in Iran on 8 January. 
One hundred and seventy-six people were 
killed when the Boeing 737-800 crashed 
shortly after take-off from Tehran. At least 
57 victims were Canadian, and, according to 
the country’s prime minister, Justin Trudeau, 
138 people on the flight were en route to 
Canada. 


“What happened yesterday was a tragedy, 
a tragedy that shocked not only Canada but 
the world,” Trudeau said at anews conference 
on 9 January, calling for a “thorough investi- 
gation” into the causes of the crash. Initial 
reports from the United States and Canada 
pointed to an Iranian surface-to-air missile. 

On 11January, Iran’s military announced that 
it had unintentionally shot down the plane. 
“The Islamic Republic of Iran deeply regrets 
this disastrous mistake,” Iran’s President 
Hassan Rouhani said on Twitter. 

Across Canada, universities lowered flags to 
half mast and planned gatherings to honour 
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the dead. “I want to express my heartfelt grief. 
Words simply cannot express the loss I know 
weallare feeling,” David Turpin, president and 
vice-chancellor at the University of Albertain 
Edmonton, wrote on 8 January. About 50 peo- 
ple attended a vigil at the Alberta Legislature 
Building on 8 January, lighting candles and 
placing photos on the steps, the Canadian 
Broadcasting Corporation reported. 

University of Alberta engineers Pedram 
Mousavi and Mojgan Daneshmand and their 
two young daughters were among those who 
died. “They were big deals,” says James Hogan, 
a materials engineer at Alberta. 
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Hogan lost a graduate student, Amir 
Saeedinia, who was travelling from Iran to 
begin his PhD in Hogan’s lab. “He worked very 
hard to create this opportunity for himself. 
This was just a start for him in Canada — he was 
coming to join the group today.” 

On 9 January, the university identified seven 
other community members who were listed 
as passengers. They include Arash Pourzarabi 
and Pouneh Gorji, graduate students in com- 
puter science who were returning to Canada 
after their wedding in Iran. Turpin said that 
university flags would be lowered to half mast 
to remember the victims, and a memorial 
service would be held. 


‘Weare all heartbroken’ 


Peyman Servati, an electrical engineer at the 
University of British Columbia in Vancouver, 
said on Twitter that he was “so so devastated” 
by the deaths of Mousavi, Daneshmand and 
their daughters — his “smart and kind friends’. 

The University of Toronto announced that 
flags at its three campuses would fly at half 
mast in memory of at least six students who 
were expected to have been on the plane. 
“We are all heartbroken,” university president 
Meric Gertler said in a statement. 

At least five University of Windsor commu- 
nity members were also listed as passengers, 
the Ontario university said in a statement. 
They included Hamidreza Setareh Kokab, 
who had begun studying for a PhD in mechan- 
ical engineering last January in Jill Urbanic’s 
lab. “He would have been successful in both 
academia and industry. We lost a bright 
light,” says Urbanic. Kokab’s wife Samira 
Bashiri, a biology research assistant, also died 
on the flight. 

At Western University in London, Ontario, 
about 250 people gathered on 8 January 
to remember 4 students who were killed in 
the crash. And the University of Waterloo 
confirmed that two of its students had died. 

Flags at the University of Guelph were 
lowered to mourn for two university graduate 
students: Ghanimat Azhdariof the Department 
of Geography, Environment and Geomatics, 
and Milad Ghasemi Ariani of the Department 
of Marketing and Consumer Studies. 

“Milad had just started his PhD with us 
in the fall, so much hope, gone,” said Statia 
Elliot, amarketing academic at the University 
of Guelph, on Twitter. “My heart goes out to 
family and friends.” 

Faisal Moola, an ecologist at the University 
of Guelph, said that it had been an “awful 
48 hours for our students and faculty”. 
“Ghanimat was such a proud member of the 
Qashqai Indigenous tribe in Iran and spoke 
with such love for her people and her ances- 
tral territories,” he wrote on Facebook. “She 
was a powerful and passionate young leader 
in defence of Indigenous Peoples across the 
planet and her life’s work continues.” 
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Chinese scientists are encouraged to study abroad and then return home. 


CHINESE ACADEMICS 
WHO WORK ABROAD ARE 
SLOWER TO WIN HONOUR 


Local networks help to speed researchers to highly 
prestigious Changjiang scholarship. 


By Hepeng Jia 


oes working in a foreign country 
enhance the career of a Chinese 
scientist? For years, China has been 
encouraging researchers to study 
abroad and then bring their expertise 
home. But a study finds that returnees take 
longer than peers who remainedin Chinato win 
oneofthe country’s highest scientific honours. 

Of the roughly 1,500 Chinese nationals 
awarded a Changjiang scholarship in the 
sciences between 1999 and 2015, those who 
received a PhD from a foreign university had 
held their doctorate for 25% longer — about an 
extra 2.3 years — when they won the scholar- 
ship than those who earnt their PhD in China. 
That’s the finding of a study led by Tang Li, a 
public-policy researcher at Fudan University 
in Shanghai, China. 

The prestigious Changjiang scholarship is 
awarded by the Chinese Ministry of Education. 
It comes witha yearly stipend of 200,000 yuan 
(US$29,000) and is seen as more important 
thanasalary or even individual professorships 
in terms of conferring respect among peers 
and indicating scientific achievement. 

International mobility has been shown 
to boost scientists’ skills and networks and, 
once academics return to their homelands, to 
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increase the country’s international expertise 
and exposure to global research practices. 
Institutions need to better recognize the 
benefits of international training and reward 
it, Tang says. But the results of the study, pub- 
lished last year, suggest that the advantages 
of overseas training might not be well recog- 
nized (F. Liand L. Tang Sci. Public Policy 46, 
518-529; 2019). The researchers also report 
that local connections help academics obtain 


Theresults of thestudy 
suggest that the advantages 
of overseas training might 
not be well recognized. 


the honour faster, and that this could explain 
why returnees are at a disadvantage. 

Over the past decade, numerous national 
recruitment programmes have attracted 
Chinese-born academics back to the country 
from foreign institutions, often with promises 
of higher salaries and research funding. The 
best known, the Thousand Talents Plan, has 
come under scrutiny in the United States inthe 
past year for potentially being a threat to US 
research and intellectual property. Academics 
returning to China from leading international 
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universities are promised relocation packages 
of up to one million yuan, as well as top sala- 
ries and millions in research grants. To learn 
more about the impact of working abroad, 
Tang and her colleague Li Feng at Hohai Uni- 
versity in Nanjing, China, decided to review the 
career trajectories of 1,447 Changjiang schol- 
ars. About one-third held PhDs from overseas 
universities, and about half had short-term 
overseas experience. 

The average time between graduating with 
a PhD and becoming a Changjiang scholar is 
10.3 years, Tang and Feng report. But it took 
researchers with any type of overseas experi- 
ence longer to receive the honour than it took 
those with no international experience. Even 
Chinese scholars who left the country temporar- 
ilyto be visiting scholars at international institu- 
tions waited 12% longer (about a year extra) for 
the title than did peers who remained in China. 

The researchers controlled for factors that 
could influence the timing of the award, such 
as gender, research field, where academics 
trained overseas and the status of the academ- 
ic’s university when they received the award. 

Inthe second part of the study, the research- 
ers examined the Changjiang scholars’ 
networks. They found that academics with 
strong local connections — measured as hav- 
ing obtained a bachelor’s degree and PhD at 
the same institution in which they work — were 
faster to obtain the title. Scholars working at 
their alma mater received the honour, on aver- 
age, about 2 to 8 months faster than did those 
working at a different university. 

Tang suggests that because award recipients 
are nominated by their university, researchers 
who have spent their academic career in China 
might be more likely to be nominated — and 
more familiar to the reviewers — than are aca- 
demics who have been out of China for years. 

Dong Jielin, a guest researcher in science 
policy at Tsinghua University in Beijing, says 
that the results support the belief among 
some that China has an unhealthy academic 
environment that rewards personal connec- 
tions over skills and experience. Efforts to 
reform this culture are needed, she says. 

But Cao Cong, a science-policy researcher 
at the University of Nottingham Ningbo China, 
has another theory. He suggests that in the 
past, researchers with overseas training might 
have been slower to obtain the award because 
they were not as academically competitive 
as the best locally trained scientists. Before 
programmes suchas Thousand Talents, many 
overseas-trained PhD graduates preferred 
to apply for foreign faculty positions, and 
so academics who returned to China might 
have done so because they couldn’t secure an 
overseas position, he says. 

Cao suggests that a future study should 
examine overseas-trained scientists who have 
returned more recently, to see whether the 
study’s conclusion still holds. 


SPACEX TESTS BLACK 
SATELLITE TO REDUCE 


REFLECTIVITY 


‘DarkSat’ prototype could help to address 
‘megaconstellation’ threat to astronomy. 


By Alexandra Witze in Honolulu, Hawaii 


he SpaceX company launched 60 of its 
Starlink broadband Internet satellites 
into orbit on 6 January — including one, 
called DarkSat, that is partially painted 
black. The probe is testing one strategy 
to reduce the brightness of satellite ‘mega- 
constellations’, which scientists fear could 
interfere with astronomical observations. 
Various companies plan to launch thousands 
of Internet satellites in the coming years; 
SpaceX, of Hawthorne, California, aims to 
launch 24 batches of Starlinks this year. By the 
mid-2020s, thousands to tens of thousands 
of new satellites could be soaring overhead. 
Bright streaks caused by light reflecting off 
them could degrade astronomical images. 
“Twas complaining to my wife that I can’t 
sleep very well these days because of this,” 
says Tony Tyson, a physicist at the Univer- 
sity of California, Davis, and chief scientist 
of the Vera C. Rubin Observatory, a major 
US telescope under construction in Chile. 
Astronomers discussed the potential impact 
of the satellites on various telescopes, and 
what could be done about the problem, on 
8 January at a meeting of the American Astro- 
nomical Society in Honolulu, Hawaii. “2020 is 
the window to figure out what makes a differ- 
encein reducing the impact,” says Jeffrey Hall, 
director of Lowell Observatory in Flagstaff, 


The diagonal lines in this image show light 
reflected from SpaceX’s Starlink satellites. 
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Arizona, and chair of the society’s committee 
on light pollution. 

“SpaceX is absolutely committed to finding 
a way forward so our Starlink project doesn’t 
impede the value of the research you all are 
undertaking,” Patricia Cooper, SpaceX’s 
vice-president for satellite government affairs, 
told a session at the astronomy meeting. 

Three batches of Starlinks have been 
launched, totalling about 180 satellites so 
far. They are most obvious in the night sky 
immediately after launch, before they boost 
their orbits to higher altitudes where they 
are farther away and look dimmer. It’s not yet 
clear how significant a problem Starlinks will 
be for astronomy, although scientists have 
complained about trails in their images. 

Many astronomers panicked in June, soon 
after SpaceX launched the first batch of 
60 Starlinks and telescopes began photo- 
graphing their trails. Their brightness came 
as asurprise, says Patrick Seitzer, an astrono- 
mer at the University of Michiganin Ann Arbor. 
“The new megaconstellations coming online 
have the potential to be brighter than 99% of 
everything else in Earth orbit, and that’s where 
the concern comes from,” he says. 

Several factors contribute to the puzzling 
brightness, astronomers reported at the 
meeting. SpaceX says the position of the solar 
panels might have something to do with it: at 
lower elevations, before the orbit boost, the 
satellites’ panels are positioned like an open 
book to reduce drag. That temporary orien- 
tation could make them reflect more sunlight. 
The speed at which a satellite moves across a 
telescope’s field of view is also important — 
the more slowly it moves, the more brightness 
accumulates per pixel of imagery. 

There are no regulations that control how 
bright or dimasatelliteneedsto be, notes Ralph 
Gaume, director of the astronomical-sciences 
division of the US National Science Foundation 
in Alexandria, Virginia. 

Calculations suggest that the Starlink 
trails will interfere with astronomy most 
significantly during the hours surrounding 
twilight and dawn. That’s a particular prob- 
lem for observations that need to be made 
during twilight, such as searches for some 
near-Earth asteroids. And on short summer 
nights, the satellite trails could be visible all 
night long. 
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News in focus 


Ecologist Michael Clarke describes 


Australian wildfires’ devastating aftermath 


Australia is in the grip of its worst wildfire 
season on record. The human death toll 
stands at 28, and thousands of homes 

have been destroyed across more than 

10 million hectares of land — an area larger 
than Portugal. An estimated 1 billion wild 
mammals, birds and reptiles have perished. 
Michael Clarke, an ecologist at La Trobe 
University in Bundoora, Melbourne, has 
been studying the effect of fires on native 
ecosystems — and how they recover — 
ever since a fire tore through his field site 
15 years ago. Clarke spoke to Nature about 
how animals fare in the wake of wildfires, 
and why this season’s fires could prove 
particularly devastating. 


What happens in the aftermath of a wildfire? 
It is deathly silent when you go into a forest 
after a fire. Apart from the undertakers — 
the carrion eaters like currawongs, ravens 
and shrike-thrushes — picking off the dead 
bodies, there’s nothing much left in the 
forest. It’s a chilling experience. 

Any animal that manages to make it 
through the fire uninjured faces three major 
challenges. One is finding shelter from 
climatic extremes, the second is the risk of 
starvation. And third, they've got to avoid 
predators. They’re exposed; there’s nowhere 


to hide in a barren landscape. 

Even if an animal makes it to an unburnt 
patch, the density of organisms trying to 
eke out a living will be way beyond the 
area’s carrying capacity. After fires in 2007, 
one unburnt patch | visited in the Mallee [a 
region in the far north of Victoria] was literally 
crawling with birds, all chasing one another, 
trying to work out who owned the last bit of 
turf. It was insufficient to sustain them all. 


Which animals are likely to be most affected? 
Animals like koalas that live above ground in 
small, isolated populations and have a limited 
capacity to flee are in all sorts of trouble. 
During past fires, we’ve seen some really 
surprising creative behaviours, like lyrebirds 
and wallabies going down wombat burrows to 
escape fire. But a large majority of animals are 
simply incinerated. Even really big, fast-flying 
birds like falcons and crimson rosellas can 
succumb to fire. 


How are this season’s fires different from 
those in previous years? 

The scale is unprecedented: such 
synchronous loss of vast areas of habitat. The 
ferocity of the fires, which can create their 
own weather, means the wildlife is at a loss 
to respond. 


Animals that survive the fires, like this wombat pictured in New South Wales, will struggle to 
find food and shelter. 
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The fires are also burning differently to 
how they have in the past. Previously, you 
could rely on wet gullies acting as natural 
barriers to the spread of a fire. This year, with 
the dryness, the fires are just going straight 
through gullies and rainforest pockets that 
would otherwise be places where animals 
could take refuge afterwards. 


How long will it take for ecosystems to 
recover? 

Recovery will probably be slower than for 
previous fires. Re-vegetation depends on 
rainfall, and that’s become so unpredictable. 
Tree hollows and nectar-producing trees, key 
resources for animals, take years or decades 
to recover. 

One concern is the future of several 
migratory bird species that fly between 
Tasmania, Victoria and southern 
Queensland. They make pit stops on the 
coastal heathlands along the east coast, 
precisely where a lot of these fires have 
been happening. It’ll be years before those 
habitats are back in production as stop-over 
points for migratory birds. 

Other animals are being pushed closer to 
extinction. There are real concerns for the 
brush-tailed rock wallaby, and the Kangaroo 
Island dunnart — a small marsupial — which 
has lost nearly all of its habitat. 


What does the future hold for these 
ecosystems? 

The challenge will be to work out how to 
protect the pockets of forest habitat that 
are left. We may need to be proactive and 
carry out controlled burns near areas that 
become wildlife refuges during fires to keep 
future fires out. That doesn’t sit well with 
me, but it may be the new normal that we're 
facing. 

These fires are unprecedented but not 
unexpected. Thirty years ago, scientists 
predicted that there would be more severe 
fires due to climate change. We are seeing 
three big changes: increased frequency of 
fire, increased severity, and increased extent 
of these blazes. That triple whammy reduces 
fauna’s capacity to recover. 


Interview by Dyani Lewis 
This interview has been edited for length and 
clarity. 
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An existing 64-dish array in South Africa is set to become part of the Square Kilometre Array. 


WORLD'S LARGEST RADIO 
TELESCOPE NEEDS TO HIT 
USS1-BILLION TARGET 


Countries won't say how much they have pledged tothe 
Square Kilometre Array in Australia and South Africa. 


By Sarah Wild 


nindependent panel of reviewers has 
approved the design of what will be 
the world’s largest telescope — the 
Square Kilometre Array (SKA). But the 
final design and construction time- 
table depend on whether the SKA collaboration 
can raise its first €940 million (US$1 billion). 

The amount that countries have already 
committed to the SKA is confidential, but itis 
understood to be less than is needed for con- 
struction to begin on schedule in 2021. When 
asked, SKA director-general Philip Diamond 
would not say how much money the project 
expects to have in the bank when building 
starts, but SKA officials say that pledges must 
be confirmed by the middle of 2020. 

The design was agreed by the review panel 
last month. “We are aggressively seeking the 
full funding commitment,” Diamond says. But 
he adds that if sufficient funds are not pledged 
intime, the collaboration will turn toa scaled- 
back design costing €691 million. This design 
has reduced computing power, and its dishes 
and antennas are squeezed closer together. 

If the full plan is achieved, the SKA will be 
able to create images 50 times more detailed 
than those produced by current instruments, 


such as the Hubble Space Telescope, and will 
shed light onsome of the most enigmatic prob- 
lems in astronomy and science, such as the 
nature of dark matter and how galaxies form. 

The gigantic telescope will be able to detect 
signals from the ‘epoch of reionization’, when 
the first stars and galaxies began to give off 
light, says Francisco Colomer, director of the 
Joint Institute for Very Long Baseline Inter- 
ferometry European Research Infrastructure 
Consortium in Dwingeloo, the Netherlands, 
which coordinates a network of radio tele- 
scopes in 13 countries. 


Budgetary barriers 

The SKA will be built in stages, but once 
complete, it will include around 2,000 radio 
dishes in 9 African countries and up to one 
million antennas in Australia. The array will 
have a total signal-collecting area of roughly 
1square kilometre, hence its name. Designs for 
the first phase — knownas SKA1— account for 
roughly 10% of the finished project, and will 
include 194 dishes in South Africa, along with 
about 130,000 antennas in Australia. 

The projectis being coordinated by tenmem- 
ber governments: those of Australia, Canada, 
China, India, Italy, Spain, South Africa, Sweden, 
the Netherlands and the United Kingdom, as 
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well as a group of French research organiza- 
tions and Germany’s Max Planck Society. 

But it’s not clear how much of the $1-billion 
construction cost has already been raised. The 
United Kingdom — which houses the project’s 
headquarters at the Jodrell Bank Observatory 
near Manchester — has contributed £119 mil- 
lion ($154 million) to the SKA project so far, 
and other countries have also provided funds. 

Around 100 organizations from 20 countries 
have worked on prototype dish and antenna 
designs, as well as the information-technology 
infrastructure needed to collect data. SKA 
spokesperson William Garnier says the 
amounts that these countries have contrib- 
uted are confidential for now, but he confirms 
that the construction funds will need to be 
banked before the collaboration can award 
building contracts. 


Costs of cutbacks 


Astronomers have voiced concerns that — if 
the project cannot raise its first $1 billion — the 
scaled-back plan would reduce the telescope’s 
sensitivity. But Diamond told Nature that this 
design can be upgraded if more money comes 
through — orifmore countries become part of 
the consortium. 

Extra countries have joined the project since 
it officially kicked off in 2013. But of the found- 
ing partners, the German government left the 
project in 2014 and New Zealand pulled out in 
July last year, the latter because officials were 
not convinced that its researchers would gain 
enough benefits from the project to justify the 
membership cost, estimated at NZ$40 million 
(US$26 million) over 10 years. 

The telescope’s budget and its start date 
have changed before, and it is possible that 
the 2021 target might be revised again. This 
is because SKA member countries have not 
yet finished creating the intergovernmen- 
tal SKA Observatory, which will be the legal 
entity responsible for managing the project 
and collecting and spending funds. 

The SKA Observatory must be established 
before governments can formally begin 
depositing project funds. For this to happen, 
at least five member countries — including 
Australia, South Africa and the United King- 
dom, whichall host parts of the project — must 
each pass a law pledging that the country will 
become part of the body. Australia, China, 
Italy, the Netherlands, Portugal, South Africa 
and the United Kingdom have all agreed to 
enact relevant legislation, but so far only the 
Netherlands has done so. 

Asource who has knowledge of the project, 
but who asked not to be named, is confident 
that the SKA will be built despite delays 
and fluctuating costs, and that it will be an 
impressive instrument irrespective of the 
chosen design. “SKA is too big to fail: there’s 
too much involvement at a senior level, too 
much money,’ they say. 
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News in focus 


US DATA SUGGEST 
HUMAN BODIES ARE 


COOLING DOWN 


Normal body temperatures area fraction of a degree 
colder than they were in the nineteenth century. 


By Ewen Callaway 


he human body is getting colder. Since 
the nineteenth century, normal body 
temperatures have dropped by a frac- 
tion of a degree, finds a study’ that 
looked at more than 150 years of data. 

People’s bodies are now, on average, cooler 
than the textbook figure of 37 °C, having fallen 
by a few hundredths of a degree per decade, 
estimates a team led by Julie Parsonnet, an 
infectious-disease epidemiologist at Stanford 
University in California (M. Protsiv et al. eLife 
9, 49555; 2020). 

German physician Carl Reinhold August 
Wunderlich determined the 37 °C figure 
in 1851. But later research found that the 
average body temperature is slightly cooler. 
A 2017 study of some 35,000 people found a 


CHILLING EFFECT 


Average normal body temperatures have dropped 
by about 0.03 °C per decade since the nineteenth 
century, according to a study of hundreds of 
thousands of temperature measurements taken 
from people in the United States. 
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temperature of 36.6 °C (Z. Obermeyer et al. 
Br. Med. J. 359, j5468; 2017). Scientists sus- 
pected that measurement error explained 
the discrepancy. 

But Parsonnet says her team’s data and mod- 
elling suggest bodies really are cooling. The 
researchers looked at three data sets — 83,900 
temperatures taken between 1862 and 1930 
from American Civil War veterans, and hun- 
dreds of thousands of measurements collected 
in the 1970s and between 2007 and 2017. 

The team found that women bornin the first 
decade of the nineteenth century had temper- 
atures 0.32 °C higher than those of women 
bornin the late 1990s; for men, the difference 
was 0.59 °C. Overall, temperatures dropped 
by 0.03 °C per decade (see ‘Chilling effect’). 

Parsonnet thinks that lower rates of long- 
term infections such as tuberculosis, which 
can elevate body temperature, explains the 
trend. In the nineteenth century, chronic 
infections were common, she says. 

That suggestion is “intriguing and plausible”, 
says Jill Waalen, an epidemiologist at the 
Scripps Research Translational Institute in 
La Jolla, California. But none of the measure- 
ments the researchers used came from the 
period beginning in the 1940s, when antibiot- 
ics were widely introduced. Waalen says that a 
marked dropin body temperatures at this time 
would support Parsonnet’s infection theory. 
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The Best Antibody Discovery Technology 
Is Now at Your Fingertips 


The Trianni Mouse™ platform is a transgenic antibody discovery platform 
offering the entire human variable gene diversity in a single organism. 


The V(D)J gene segments in The Trianni Mouse are chimeric, but the 
variable domains of antibodies made by the mouse are entirely human. 
The result is human antibody leads generated from antibody genes 


optimized for function in the mouse. 


To learn more about this innovative platform, visit Trianni.com. 


Heavy Chain CDR3 Compositions of a Human Individual and a Trianni Mouse are Almost Identical 
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FINDING THE CRISPR OFF-SWITCH 


Researchers have identified a bevy of anti-CRISPR tools that 
could aid in medicine and biosecurity. By Elie Dolgin 


t started out as “sort of a stupid thing to 
do”, recalls Joe Bondy-Denomy, a micro- 
biologist at the University of California, 
San Francisco. As a graduate student in 
the early 2010s, he tried to infect bacteria 
with viruses that, on paper, shouldn’t 
have stood a chance. He knew that these 
viruses, or phages, were susceptible to 
CRISPR-Cas, the bacterial defence system that 
scientists have harnessed as a powerful tool for 
gene editing. And in most cases, he was right: 
the CRISPR machinery chopped the incoming 
phages into bits. Butin afew instances, against 
the odds, the intruders survived. 
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Bondy-Denomy thought he had messed up. 
“Then a light bulb went off,” he says. Maybe 
something inside the bacterial genome was 
disarming its defences. And maybe that 
self-sabotaging bit of DNA was coming from 
previous viral invaders. 

A quick comparison of DNA sequences 
proved Bondy-Denomy’s intuition correct. 
Phage genes nestled inside the bacterial 
genome were completely shutting down the 
CRISPR-Cas system, making the bacteria 
vulnerable’. 

“Joe got the result that changed everything,” 
says Alan Davidson, a phage biologist at the 
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University of Toronto in Canada, who was 
Bondy-Denomy’s PhD adviser at the time. 
“He found something amazing that we never 
expected.” 

Bondy-Denomy — together with Davidson, 
microbiologist Karen Maxwell and fellow 
graduate student April Pawluk — had stum- 
bled onto tools now known as anti-CRISPRs. 
These proteins serve as the rocks to CRISPR’s 
molecular scissors. And soon, they were 
popping up everywhere: more than 50 
anti-CRISPR proteins have now been charac- 
terized, each with its own means of blocking 
the cut-and-paste action of CRISPR systems. 


ILLUSTRATION BY SEBASTIEN THIBAULT 


The expansive roster opens up many 
questions about the archaic arms race between 
bacteria and the phages that prey on them. 
But it also provides scientists with a toolkit for 
keeping gene editing in check. 

Some are using these proteins as 
switches to more finely control the activity 
of CRISPR systems in gene-editing appli- 
cations for biotechnology or medicine. 
Others are testing whether they, or other 
CRISPR-stopping molecules, could serve as 
biosecurity countermeasures of last resort, 
capable of reining in some genome-edited 
bioweapon or out-of-control gene drive. 

“For any reason you can think of to turn off 
CRISPR systems, anti-CRISPRs come into play,” 
says Kevin Forsberg, a microbial genomicist at 
the Fred Hutchinson Cancer Research Center 
in Seattle, Washington. 

Yet, despite a growing number of proposed 
applications and proof-of-concept experi- 
ments in the laboratory, researchers have yet 
to pin down the therapeutic potential of these 
anti-CRISPR systems. Jennifer Doudna, a bio- 
chemist at the University of California, Berkeley, 
and one of the pioneers of CRISPR gene editing, 
voices a question that she says is oneveryone’s 
lips: “How do you actually use these ina way that 
will provide meaningful control?” 

“That’s certainly where that whole 
anti-CRISPR field needs to go,” she says. “It 
just hasn’t gone there yet.” 


Allhell breaks loose 


Despite the growing focus on anti-CRISPRs 
— with about one paper a week published on 
the topic in 2019 — the initial discovery by 
Davidson and his students flew under the radar. 

To most scientists, it seemed like an esoteric 
example of evolutionary warfare — especially 
given that the anti-CRISPR proteins discov- 
ered were all specific to one particular form of 
bacterial defence, knownas the typeI CRISPR 
system. The darling of genome editing has 
been the type II system and its archetypal 
DNA- cutting protein, Cas9. 

“For the wider biological audience to really 
take notice,” says Pawluk, now an editor at Cell, 
“it had to be Cas9”. 

In December 2016, Pawluk, still working in 
Davidson’s lab, and Bondy-Denomy, leading 
his own independent research group, each 
identified inhibitors to the Cas9 enzyme’. 
This time, researchers around the world seized 
on the findings. “Like everything else in the 
CRISPR world, the thin edge of the wedge 
comes in, and the next thing you know all hell 
breaks loose,” says Erik Sontheimer, amolecu- 
lar biologist at the University of Massachusetts 
Medical School in Worcester and a co-author 
on Pawluk’s paper’. 

In less than three months, structural 
biologists at the Harbin Institute of Technol- 
ogy in China had deciphered the molecular 
mechanism by which one of Bondy-Denomy’s 


anti-CRISPR proteins, called AcrIIA4, shut 
off Cas9 activity* (see ‘CRISPR correctives’). 
A few months later, Doudna, working with 
Bondy-Denomy and biochemist Jacob Corn, 
nowat the Swiss Federal Institute of Technology 
in Zurich, offered the first demonstration that 
anti-CRISPRs had practical value: they showed 
that delivering AcrIIA4 into human cells, either 
alongside or right after introducing Cas9, could 
halt gene-editing activity in its tracks. 

That’s useful, because if Cas9 remains active 
for too long, it raises the risk of unintended 
edits. Doudnaand her collaborators reported 
that an anti-CRISPR protein could limit the 
‘off-target’ effects that researchers and inves- 
tors have fretted over since early in CRISPR’s 
development. 


Aa 


JOE GOT THE RESULT 
THAT CHANGED 
EVERYTHING. HE FOUND 
SOMETHING AMAZING 
THAT WE NEVER 
EXPECTED. 


Curbing off-target activity would be a big 
contribution to the field of CRISPR therapeu- 
tics, says David Rabuka, chief executive and 
cofounder with Bondy-Denomy of Acrigen 
Biosciences, based in Berkeley. The company’s 
pitch: “We're going to make gene editing more 
efficient and safer,” Rabuka says. 

Anti-CRISPRs could also help to confine 
editing activity to particular cells and tissues in 
the body. In 2019, research teams in Germany, 
Japan and the United States independently 
attempted to use the proteins in tandem with 
small regulatory molecules called microRNAs 
to bring about tissue-specific editing® ®. The 
US team, led by Sontheimer, even showed that 
the approach could workin mice — theirsis the 
only published study so far to demonstrate 
that anti-CRISPR proteins can work ina living 
animal, and not just cells®. 

Sontheimer and his colleagues wanted to 
allow editing in the liver while suppressing 
it in all other tissues of the mouse. So they 
designed an anti-CRISPR protein that would 
be active everywhere except in the presence 
of microRNA-122, which is found only in the 
liver. Inthe mice, the anti-CRISPR successfully 
blocked Cas9 editing throughout the body, 
except in that one organ. 

Although the paper focused on liver- 
directed editing, the platform is “plug and 
play”, says Sontheimer: any organs that pro- 
duce a unique microRNA at high expression 
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levels could be targeted in this way, provided 
that the anti-CRISPR proteins don’t trigger 
unwanted immune effects. 


Not immune to challenges 


Because of previous exposure to microbes 
harbouring CRISPR-Cas systems, many peo- 
ple have immune systems that are already 
primed to attack and disable the Cas9 protein. 
That could pose a challenge. In mice, just one 
dose of a CRISPR-based medicine can elicit a 
strong enough immune response to render 
subsequent treatments ineffective. 

According to Sontheimer, anti-CRISPR 
proteins could be prone to the same rejection 
issue, potentially imperilling the technology 
and triggering dangerous, inflammatory reac- 
tions in patients. 

Other types of CRISPR inhibitor shouldn't 
have the same limitation. Last May, a team 
led by Amit Choudhary, a chemical biologist 
at the Broad Institute of MIT and Harvard 
in Cambridge, Massachusetts, described a 
new way of identifying small-molecule drugs 
capable of disrupting Cas9 activity’. The com- 
pounds his team identified are not as potent 
as natural anti-CRISPR proteins, but they are 
more likely to sneak past the immune system, 
tocross cell barriers and to allow for reversible 
control of Cas9 activity. 

Elsewhere, researchers have designed short 
strings of nucleic acids that grab ontotwo parts 
of the Cas9 complex and completely shut down 
gene editing in human cells”. “We're pretty sure 
that what we have works better than all the best 
anti-CRISPR proteins out there already,” says 
Keith Gagnon, an RNA biochemist at Southern 
Illinois University in Carbondale who led the 
research. And other groups, including virol- 
ogist Brooke Harmon’s at Sandia National 
Laboratories in Livermore, California, have 
synthesized tiny protein fragments that show 
potential as anti-CRISPR agents. “It’s nice to 
have alot of different options,” Harmon says. 

That diversity could be important in medical 
applications: for example, in limiting the 
editing activity of gene-targeted medicines, 
or fashioning phage therapies capable of 
wiping out difficult-to-treat bacteria without 
being stymied by the pathogen’s own CRISPR 
defences. It might also help in other proposed 
applications of CRISPR-blocking technologies. 

Take gene-drive systems, in which scientists 
deploy CRISPR gene editing to spread a DNA 
modification swiftly through an entire popu- 
lation. Some public-health officials hope that 
the technique might allow for the complete 
eradication of disease-carrying mosquitoes 
or ticks, for example. 

But concerns over unforeseen ecological 
impacts abound. Many public officials and 
researchers also worry about gene drives being 
weaponized to wipe out agricultural systems or 
to spread a deadly disease. 

Anti-CRISPRs could provide a molecular 
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safety net against these potential bio-attacks, 
says Sandia biochemist Joe Schoeniger. “You 
need to have an off-button,” he says. 

For now, such applications are mostly 
hypothetical. The only published report of 
researchers using anti-CRISPR proteins to 
inhibit a gene drive comes from a proof-of- 
principle experiment in yeast". However, 
the idea is gaining traction, including among 
researchers hoping to halt the spread of malaria 
by forcing harmful genes to spread throughan 
entire population of mosquitoes. 

Andrea Crisanti, amolecular parasitologist 
at Imperial College London, says that he has 
used anti-CRISPR genes to halt a mosquito- 
eradicating gene-drive system. The gene drive, 
which disrupts female fertility, can wipe out 
mosquitoes in the lab in about ten genera- 
tions”. But in unpublished work, his team has 
added anti-drive mosquitoes to the mix, and 
“they can completely, 100% block the drive”, 
Crisanti says. “We can stop the population from 
crashing.” 


Insurance policy 


As Crisanti looks ahead to field-testing his 
sterilization strategy, he imagines having cages 
of anti-drive mosquitoes at the ready, just in 
case things go awry. “It’s kind of like buying an 
insurance,” he says. 

But the need for CRISPR containment goes 
beyond gene drives. “If there’s an adverse event 
inaclinical trial or anefarious use of agenome 
editor, we’re not going to know what that looks 
like until it happens,” says Renee Wegrzyn, a 
biosecurity scientist at the US government’s 
Defense Advanced Research Projects Agency 
(DARPA) in Arlington, Virginia. 

That’s why DARPA, in 2017, launched the Safe 
Genes programme, afour-year, US$65-million 
initiative aimed at combating the dangers of 
CRISPR technologies. This has involved dis- 
covering new inhibitors against all types of 
CRISPR-Cas system and finding anti-CRISPRs 
that function in unique and useful ways. 
Bondy-Denomy, Choudhary, Crisanti, Doudna 
and the Sandia team, among others, are all 
recipients of this funding. 

Beyond its biotechnology applications, the 
anti-CRISPR strategy is opening up fresh pos- 
sibilities for basic research, too. “It’s become 
one of our favourite tools,” says Shawn Liu, a 
neuro-epigeneticist at Columbia University 
Medical Center in New York City. Liu studies 
how a modified CRISPR-Cas9 system can 
change the expression levels of agene through 
epigenetic modifications — that is, without 
altering the underlying sequence. Anti-CRISPR 
proteins helped him to show how long the 
effects lasted. 

They also came in handy when researchers 
were looking for mutant strains of bacteria 
that could fend off phage attacks more effec- 
tively than standard ones. A team led by Sylvain 
Moineau, a phage biologist at Laval University in 
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CRISPR CORRECTIVES 


There are two particularly well-studied types of CRISPR DNA editing. Type | uses a Cascade complex and guide 
RNA to bind a DNA target, which is then cut with the Cas3 enzyme. Type II uses a single enzyme, such as Cas9, 
to bind and cut the target sequence. Researchers have discovered more than 50 anti-CRISPR (Acr) proteins 
that turn off DNA-editing activity in a variety of ways. Here are two commonly observed mechanisms. 


DNA-binding inhibition 
Some Acr proteins 
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binding target DNA. 
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DNA-cleavage inhibition 
Some Acr proteins 
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cutting action. 
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Quebec City, Canada, focused on Streptococcus 
thermophilus, a microbe used to make cheese 
and yoghurt"*: “We used a phage containing 
an anti-CRISPR protein as a tool to find other 
defence mechanisms,” he explains. 

Other scientists are incorporating 
anti-CRISPRs into tools such as biosensors 
that can track how much of atherapeutic gene 
editor is active inside cells, and optogenetic 
control strategies that allow researchers to 
switch on Cas9 genome targeting at the flick 
ofalaser beam. 

“A lot ofitis stillin the stage of ‘toy’ systems,” 
says Chase Beisel, a bioengineer at the 
Helmholtz Institute for RNA-based Infection 
Research in Wiirzburg, Germany. “But the 
concept is there, at least.” 


Open questions 


As bioengineers continue to tinker with 
anti-CRISPRs, and as companies such as 
Acrigen move to introduce the inhibitors into 
therapeutic platforms, some biologists have 
also begunto grapple with more philosophical 
questions about the evolution of CRISPR-Cas 
systems inthe first place. If bacteria with intact 
CRISPR protections commonly harbour 
phage-derived sequences for inhibitors that 
neutralize this immunity, then “CRISPR is 
clearly not doing its defence role in many of 
those cases”, says Edze Westra, who studies the 
ecology of CRISPR systems at the University of 
Exeter, UK. And yet, natural selection seems to 
maintain the system in working order. So, he 
asks, “what is its role apart from fuelling biotech 
start-up companies?” 

Some studies point to bacteria using CRISPR- 
Cas systems in forming biofilms, repairing DNA 
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and conducting other regulatory processes 
involved in enhancing virulence. And it’s pos- 
sible that once anti-CRISPRs have defanged Cas 
enzymes of their DNA-cutting abilities, bacteria 
will have repurposed the gene editors for other 
uses, says Maxwell, the University of Toronto 
microbiologist. 

Those bedevilling mysteries won't halt the 
steady march of CRISPR gene editing into 
human therapeutics, pest control and more. 
And for many, that’s why anti-CRISPRs are so 
important. 

“There needs to be this shift to really 
controlling these editors so we make sure 
that you get the change you want and nothing 
else,” says Doudna. And just as the CRISPR-Cas 
systems that ushered ina biotechnology revolu- 
tion started with afew curious observationsina 
laboratory, she notes, so too did the discovery 
of the inhibitors that could be amuch-needed 
corrective. 


Elie Dolgin is a science journalist in 
Somerville, Massachusetts. 
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US president Ronald Reagan based his tax cuts of the 1980s on questionable ideas. 


The dangers of 
fringe economics 


Governments need to heed the evidence. Three books 
warn of the perils of untested ideas. By Ehsan Masood 


n November 2017, the economist Vera 
Shlakman died at the age of 108. Her 
1935 Economic History of a Factory Town 
is a landmark in the field. Chronicling 
how textile manufacturing transformed 
Chicopee, Massachusetts, Shlakman zeroed 
inon working women’s lives, vaulting beyond 
analyses of data on wages and shift lengths to 
include the value of dowries and information 
in letters and diaries. Ousted from teaching 
economics during the McCarthy era of the 
1950s, she never published another book. 


I thought of Shlakman, and how far we 
have strayed from such integrated analyses 
of economic realities, while combing through 
Simon Bowmaker’s 2019 When the President 
Calls. Over the past half-century, Bowmaker 
shows, economic advisers to US presidents 
from Richard Nixon to Donald Trump 
have enabled central bankers and treasury 
officials to implement untethered ideas. 
Often described in terms borrowed from 
mathematics or physics (suchas the ‘velocity 
of money’), these concepts neither command 
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an expert consensus nor are they necessarily 
reproducible. 

Two other new books, both by economics 
Nobel laureates, also capture the spirit of Shlak- 
man’s diverse thinking: Paul Krugman’s Arguing 
with Zombies, and Good Economics for Hard 
Times by Abhijit Banerjee and Esther Duflo. 

Banerjee and Duflo’s book appeared a 
month before they were awarded the 2019 
Nobel, which they shared with Michael 
Kremer. It encapsulates nearly two decades 
of research bringing field trials of policies in 
low-income countries into the mainstream 
— from improving educational outcomes, 
to uptake of vaccination. Krugman’s tome, 
meanwhile, mulls over the mistakes of the 
past two decades. The ‘zombies’ of his title 
are economic theories and policies that 
should have been killed by evidence, but keep 
coming back — suchas the idea that inequality 
isnecessary for growth. Ina world still reeling 
from the impact of the 2008 financial crisis, 
Krugman (who was awarded his Nobel that 
year) has harsh words for practitioners cling- 
ing to the old ways. 
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Abhijit Banerjee and Esther Duflo advocate field trials of economic policies, notably in low-income nations. 


A prominent example is the ‘Laffer curve’, 
named after Arthur Laffer, who later became 
Ronald Reagan’s economic adviser. Bowmaker 
clearly relished the chance to quiz Laffer, 
who is reported to have sketched the idea 
out on a napkin over a 1974 lunch with two 
White House officials: Donald Rumsfeld and 
Dick Cheney. If governments raise taxes for 
people with modest incomes and cut taxes 
for the wealthy, Laffer argued, they can raise 
revenue and boost growth. The former, 
because lower-income earners far out- 
number the rich, thus amplifying their total 
tax contribution; the latter, because wealthy 
business owners are likely to use cash saved 
from tax cuts to invest in new products, more 
jobs or equipment, thus boosting growth. 


Untested tax cuts 


There is no consensus on whether the Laffer 
curve is accurate — even some leading 
conservative economists, such as Gregory 
Mankiw, are critical. Yet it became the basis for 
tax cuts, beginning with Reagan’s decision to 
slash the top rate of income tax from 70% down 
to 28% over the early to mid-1980s. Bowmaker 
found Laffer still buoyant. “That’s my baby and 
I just loved it,” Laffer said. “It was the best tax 
bill in US history.” Trump awarded him the 
Presidential Medal of Freedom in 2019. 

As these three books reveal, however, the 
ideas that Laffer and others came to represent 
are nowunder severe pressure, even from the 


312 | Nature | Vol577 | 16 January 2020 


When the President Calls: Conversations with 
Economic Policymakers 

Simon W. Bowmaker 

MIT Press (2019) 


Arguing with Zombies: Economics, Politics, 
and the Fight for a better Future 

Paul Krugman 

W. W. Norton (2020) 


Good Economics for Hard Times 
Abhijit V. Banerjee and Esther Duflo 
PublicAffairs (2019) 


centre-right parties in high-income nations 
that initially backed them. Decades of fall- 
ing or flat public spending, unrestricted free 
trade, relatively light regulations on financial 
institutions and low taxes on businesses and 
top earners have not translated into across- 
the-board prosperity. That is seen especially 
in the United States and Britain, now among 
the most unequal countries in their peer 
group. In 2016, six out of northern Europe’s 
ten poorest regions, as measured by gross 
domestic product per person, were in the 
United Kingdom. 

Economic nationalism has emerged from 
these trends, under slogans ranging from 
“America First” to the UK “Take Back Control”. 
The real results are the Trump administration’s 
public disavowal and renegotiation of the 
North American Free Trade Agreement — and 
Brexit. 
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At the same time, the collapse of what were 
once seen as mainstream economic ideas 
by politicians right and left has opened up 
space for more conventionally green-left 
approaches to policymaking. That explains 
in part why the Italian American economist 
Mariana Mazzucato is being heard across 
political divides. In books such as The Value 
of Everything (2018), Mazzucato makes a 
strong case for the state as enabler in an 
economic policy that privileges well-being 
and sustainability. That is also the space 
into which Banerjee and Duflo enter in Good 
Economics for Hard Times. Like Krugman, 
they are critical of policies based on weak or 
non-existent evidence. But their approach is 
less argument than stepping back to let the 
research do the talking. 


Consensus and controversy 


The studies they cite probe hot topics suchas 
climate change, immigration and the viability 
of continued economic growth. Banerjee 
and Duflo synthesize the literature on what 
is agreed and what is controversial in an 
accessible, often entertaining way. There are 
gaps, however. Mentioning the work of Kate 
Raworth and Tim Jackson on the environmen- 
tal impacts of constant growth, and of Partha 
Dasgupta on the value of biodiversity, would 
have enriched and unified their thinking on 
the impact of consumption-fuelled growth 
on climate change, and on biodiversity and 
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ecosystem services. 

Grand narratives, which smack of the old 
economic thinking, are not the goal of Banerjee 
and Duflo. The authors do, however, need to 
articulate their approach in narrative terms, 
or they will struggle to be heard. As Bowmaker 
demonstrates, US presidents want to hear 
happy endings, rather than sit through amenu 
of options communicated seminar-style. 
Harry Truman is reported to have said that he 
preferred one-handed economists, because he 
didn’t like hearing “on the other hand”. Barack 
Obama was an exception, making decisions 
after hearing arguments pro and con. 


“There is strengthin 
numbers and robustness 
in diversity.” 


Sadly, neither Arguing with Zombies nor 
Good Economics for Hard Times tackles in 
depth what I feel is the defining challenge 
for newer generations of economic policy 
advisers. That is, how to mitigate the risks 
of expert-shopping by policymakers. If 
researchers with fringe ideas continue to 
validate untested theories, yet more zombies 
will invade the corridors of power. 

This happens to scientific advisers, too — 
although perhaps less often. In the 1990s, gov- 
ernments with significant oil and gas interests 
joined the powerful fossil-fuel industry lobby 
in seeking experts who could cast doubt on 
human influence on climate change. The 
consensus view of the Intergovernmental 
Panel on Climate Change — backed as it was by 
a huge number of heavyweight researchers — 
was essential in preventing such dubious ideas 
from penetrating the mainstream (although it 
has not stopped Trump’s withdrawal from the 
2015 Paris climate agreement). 

Economists need to organize similarly 
across different schools of thought — and to 
include development economics as well as 
ecological and environmental economics 
and feminist economics. There is strength in 
numbers and robustness in diversity. That can 
go some way towards curbing the unworkable 
concepts that continue to emerge. 

As these three thoughtful, timely books 
demonstrate in their own ways, a space has 
opened up for newideas in economics at atime 
of widespread inequality, social and cultural 
schisms, and environmental crisis. That is 
an opportunity to avoid another 50 years 
of theories that inform the highest levels of 
policy as if evidence didn’t matter. 


Ehsan Masood is Editorials editor and bureau 
chief for the Middle East and Africa at Nature 
in London. His latest book is The Great 
Invention: The Story of GDP and the Making 
and Unmaking of the Modern World. 
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Dark Data 

David J. Hand Princeton Univ. Press (2020) 

We are deluged with billions of bytes of data, yet much crucial 
information goes unseen and unreported. So reveals statistician 
David Hand in this penetrating study of missing (‘dark’) data and its 
impacts on decisions — skewing stats, enabling fraud, embedding 
inequity and triggering preventable catastrophes. Advocating “data 
science judo”, Hand offers expert training, from recognizing when 
facts are being cherry-picked to designing randomized trials. A book 
illuminating shadowed corners in science, medicine and policy. 
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DELUSION 


The Surprising Science of 
How We Are Connected 
and Why That Matters 


TOM OLIVER 


The Self Delusion 

Tom Oliver Weidenfeld & Nicolson (2020) 

Humans are less discrete entities than mash-ups of microbiota and 
shifting beliefs, declares ecologist Tom Oliver in this rich, intriguing 
book. We are, he shows, so interfused with the environment that all 
life might be seen as a web of genes, and all minds a web of memes. 
Oliver reframes the self as a fleeting union of molecules, a target 

for manipulation by parasites, a cooperative co-creator who is also 
destroying the biosphere. But by recognizing our connectedness, he 
argues, we enable needed societal and environmental change. 
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Uncanny Valley 

Anna Wiener Farrar, Straus and Giroux (2020) 

Start-ups have long been seen as a geek-driven, idealistic antidote 

to corporate culture. Anna Wiener’s unsettling memoir may muddy 
that image. In 2013, a 20-something Wiener was drawn to the digital 
economy of California’s Silicon Valley. Soon enough she recognized it 
as a reckless, male-dominated world of barely regulated surveillance. 
She witnessed the boom in online abuse and political trolling from 
the inside, and the growing inequity in San Francisco fuelled by 
venture capitalists. An acute eye on a dystopia in the making. 


CHIP WALTER 


IMMORTALITY ING 


Immortality, Inc. 

Chip Walter National Geographic (2020) 

Extreme longevity might seem a seductive concept to some. To 

a handful of prominent researchers, it’s an experimental goal. 
Venturing into that rarefied world, journalist Chip Walter interviewed 
stars such as biotechnologist J. Craig Venter and X Prize founder 
Peter Diamandis. Their eventful stories are woven through Walter's 
tour of biotech research centres Calico and Celularity, and fields 
from cryopreservation to regeneration. Results remain broadly 
inconclusive, but this witty look at ‘curing’ death is worth the ride. 


» EXPLODING 
STARS and 
INVISIBLE 


+. 
THE SCIENCE OF WHAT'S OUT THERE 
Fred Watson 


Exploding Stars and Invisible Planets 

Fred Watson Columbia Univ. Press (2020) 

Astronomer Fred Watson is a science communicator par excellence. 
Here, with infectious enthusiasm, he plunges the reader into the 
science on sky-watching and space observation. Kicking off with 

a nuanced discussion of twilight — covering everything from 
crepuscular rays to the ‘green flash’ — he moves on to meteor 
showers, the potential contamination of the Solar System’s ice moons 
by earthly microbes, the mystery of a hypothesized Planet Nine and 
the real origins of the Moon. Barbara Kiser 
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Setting the agenda in research 


Comment 


Time forthe Human 
Screenome Project 


Byron Reeves, Thomas Robinson & Nilam Ram 


To understand how people 
use digital media, researchers 
need to move beyond screen 
time and capture everything 
we do and see on our screens. 


here has never been more anxiety 

about the effects of our love of 

screens — which now bombard us with 

social-media updates, news (real and 

fake), advertising and blue-spectrum 
light that could disrupt our sleep. Concerns are 
growing about impacts on mental and physical 
health, education, relationships, even on pol- 
itics and democracy. Just last year, the World 
Health Organization issued new guidelines 
about limiting children’s screen time; the 
US Congress investigated the influence of 
social media on political bias and voting; and 
California introduced alaw (Assembly Bill 272) 
that allows schools to restrict pupils’ use of 
smartphones. 

All the concerns expressed and actions 
taken, including by scientists, legislators, 
medical and public-health professionals and 
advocacy groups, are based on the assump- 
tion that digital media — in particular, social 
media — have powerful and invariably negative 
effects on human behaviour. Yet so far, it has 
been a challenge for researchers to demon- 
strate empirically what seems obvious expe- 
rientially. Conversely, it has also been hard for 
them to demonstrate that such concerns are 
misplaced. 

A major limitation of the thousands of 
studies, carried out over the past decade or 
so, of the effects of digital mediais that they do 
not analyse the types of data that could reveal 
exactly what people are seeing and doing on 
their screens — especially in relation to the 
problems that doctors, legislators and par- 
ents worry most about. Most use self-reports 
of ‘screen time’. These are people’s own esti- 
mates of the time they spend engaging with 
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screens or with platforms that are categorized 
as ‘smartphone; television’, ‘social media’, 
‘political news’ or ‘entertainment media’. Yet 
today’s media experiences defy such simplis- 
tic characterization: the range of content has 
become too broad, patterns of consumption 
too fragmented’, information diets too idio- 
syncratic’, experiences too interactive and 
devices too mobile. 

Policies and advice must be informed by 
accurate assessments of media use. These 
should involve moment-by-moment cap- 
ture of what people are doing and when, 
and machine analysis of the content on their 
screens and the order in which it appears. 

Technology now allows researchers to 
record digital life in exquisite detail. And 
thanks to shifting norms around data sharing, 
and the accumulation of experience and tools 
in fields suchas genomics, it is becoming eas- 
ier to collect data while meeting expectations 
and legal requirements around data security 
and personal privacy. 

We call for a Human Screenome Project 
—acollective effort to produce and analyse 
recordings of everything people see and do 
on their screens. 


Screen time 


According to a 2019 systematic review and 
meta-analysis’, over the past 12 years, 226 stud- 
ies have examined how media useis related to 
psychological well-being. These studies con- 
sider mental-health problems suchas anxiety, 
depression and thoughts of suicide, as well 
as degrees of loneliness, life satisfaction and 
social integration. 

The meta-analysis found almost no 
systematic relationship between people’s 
levels of exposure to digital media and their 
well-being. But almost all of these 226 stud- 
ies used responses to interviews or question- 
naires about how long people had spent on 
social media, say, the previous day. 

The expectation is that if someone reports 
being on Facebook a lot, then somewhere 
among all those hours of screen time are the 
ingredients that influence well-being, for 
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better or worse. But ‘time spent on Facebook’ 
could involve finding out what your friends 
are doing, attending a business meeting, 
shopping, fundraising, reading a news arti- 
cle, bullying, even stalking someone. These 
are vastly different activities that are likely to 
have very different effects ona person’s health 
and behaviour. 

Another problem is that people are 
unlikely to recollect exactly when they did 
what*». Recent studies that compared sur- 
vey responses with computer logs of behav- 
iour indicate that people both under- and 
over-report media exposure — often by as 
much as several hours per day® ®. In today’s 
complex media environment, survey ques- 
tions about the past month or even the past 
day might be almost useless. How many times 
did you look at your phone yesterday? 

The US National Institutes of Health (NIH) is 
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currently spending US$300 million ona vast 
neuroimaging and child-development study, 
eventually involving more than 10,000 chil- 
dren aged 9 and 10. Part of this investigates 
whether media use influences brain and cog- 
nitive development. To indicate screen use, 
participants simply pick from a list of five 
standard time ranges, giving separate answers 
for each media category and for weekdays and 
weekends. (The first report about media use 
from this study, published last year, showed 
a small or no relationship between media 
exposure and brain characteristics or cogni- 
tive performance in computer-based tasks’.) 


Digital life 

Instead, researchers need to observe in 
exquisite detail all the media that people 
engage with, the platforms they use and the 
content they see and create. How do they 


switch between platforms and between 
content within those? How do the moments 
of engagement with various types of media 
interact and evolve? In other words, academics 
need a multidimensional map of digital life. 
To illustrate, people tend to use their 
laptops and smartphones in bursts of, onaver- 
age, 10-20 seconds”. Metrics that quantify the 
transitions people make between media seg- 
ments within a session, and between media and 
the rest of life, would provide more temporally 


“Intoday’s complex media 
environment, survey 
questions about the past 
month or even the past day 
might be almost useless.” 
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refined representations of actual use patterns. 
Asession begins when the screen lights up and 
ends when it goes dark, and might last less than 
asecond if it entails checking the time. Or it 
could start with a person responding to their 
friend’s post on Facebook, and end an hour 
later when they click onalink to read anarticle 
about politics. 

Measures of media use must also take 
account of the scattering of content. Today’s 
devices allow digital content that used to be 
experienced as a whole (suchas a film, news 
story or personal conversation) to be atom- 
ized, and the pieces viewed across several 
sessions, hours or days. We need measures 
that separate media use into content cate- 
gories (political news, relationships, health 
information, work productivity and so on) — 
or, even better, weave dissimilar content into 
sequences that might not make sense to others 
but are meaningful for the user. 

To try to capture more of the complexity, 
some researchers have begun to use logging 
software. This was developed predominantly 
to provide marketers with information on what 
websites people are viewing, where people are 
located, or the time they spend using various 
applications. Although these datacan provide 
more-detailed and -accurate pictures than 
self-reports of total screen time, they don’t 
reveal exactly what people are seeing and 
doing at any given moment. 


A better way 


To record the moment-by-moment changes 
ona person’s screen?™, we have built a plat- 
form called Screenomics. The software 
records, encrypts and transmits screenshots 
automatically and unobtrusively every 5 sec- 
onds, whenever a device is turned on (see 
go.nature.com/2fsy2j2). When it is deployed 
on several devices at once, the screenshots 
from each one are synced in time. 

This approach differs from other attempts 
to track human-computer interactions — for 
instance, through the use of smartwatches and 
fitness trackers, or diaries. Itis more accurate, 
it follows use across platforms, and it samples 
more frequently. In fact, we are working on 
software that makes recordings every second. 

We have now collected more than 30 mil- 
lion screenshots — what we call ‘screenomes’ 
— from more than 600 people. Even just two 
of these reveal what can be learnt froma fine- 
grained look at media use (see ‘Under the 
microscope’ and All in the details’). 

This higher-resolution insight into media 
use could help answer long-held questions 
and lead to new ones. It might turn out, for 
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Comment 


UNDER THEMICROSCOPE 


Recordings of smartphone use by two 
14-year-olds living in the same northern 
California community reveal what can be 
learnt from a fine-grained analysis of media 
use (see ‘All in the details’). 


Dose. A typical question that researchers 
might ask is whether study participants 

are ‘heavy’ or ‘light’ phone users. Both 
adolescents might have characterized their 
phone use as ‘substantial’ had they been 
asked the usual survey questions. Both 
might have reported that they used their 
smartphones ‘every day’ for ‘2 or more hours’ 
each day, and that looking at their phones 
was the first thing they did each morning and 
the last thing they did every night. 

But detailed recordings of their actual 
phone use over 3 weeks in 2018 highlight 
dramatic differences’. For participant A, 
median use over the 3 weeks was 3.67 hours 
per day. For participant B, it was 4.68 hours, 
an hour (27.5%) more. 


Pattern. The distribution of time spent 
using phones during the day differed even 
more. On average, participant A’s time was 
spread over 186 sessions each day (with a 
session defined as the interval between the 
screen lighting up and going dark again). 
For A, sessions lasted 1.19 minutes on 
average. By contrast, participant B’s time was 
spread over 26 daily sessions that lasted, 
on average, 2.54 minutes. So one of the 
adolescents turned their phone on and off 
seven times more than the other, using it in 
bursts that were about one-third the length 
of the other's sessions. 

These patterns could signal important 
psychological differences. Participant A’s 
days were more fragmented, maybe 


instance, that levels of well-being are related 
to how fragmented people’s use of mediais, 
or the content that they engage with. Differ- 
ences in brain structure might be related to 
how quickly people move through cycles of 
production and consumption of content. 
Differences in performance in cognitive tasks 
might be related to how much of a person’s 
multitasking involves switching between con- 
tent (say, from politics to health) and applica- 
tions (social media to games), and how long 
they spend on each task before switching. 


The Human Screenome Project 


So, how can we do better? What’s needed 
is a collective effort to record and analyse 
everything people see and do ontheir screens, 
the order in which that seeing and doing 
occurs, and the associated metadata that are 
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indicating issues with attentional control, 
or perhaps reflecting an ability to process 
information faster. 


Interactivity. Both adolescents spent time 
creating content as well as consuming 

it. They wrote text messages, recorded 
photos and videos, entered search terms 
and so on. On a questionnaire, both might 
have reported that they posted original 
material ‘sometimes’ or maybe ‘often’. But 
the screenshot data reflect patterns of 
interactivity that would be almost impossible 
for them to recall accurately. 

Participant A spent 2.6% of their screen 
time in production mode, creating content 
evenly throughout the day and usually within 
social-media apps. By contrast, participant B 
spent 7% of their total screen time producing 
content (and produced 2.5 times more). 

But they did so mainly in the evening while 
watching videos. 


Content. During the 3 weeks, participant A 
engaged with 26 distinct applications. More 
than half of these (53.2%) were social-media 
apps (mostly Snapchat and Instagram). 
Participant B engaged with 30 distinct 
applications, mostly YouTube (50.9% of the 
total). 

Zooming deeper into specific screen 
content reveals even more. For participant B, 
on average, 37% of the screenshots for 
a single day included food — pictures of 
food from various websites, photos of B’s 
own food, videos of other people eating or 
cooking, and food shown in a game involving 
the running of a virtual restaurant. 

In a survey, both adolescents might have 
reported that they used ‘a lot’ of apps, and 
might have given the names of some of 
them. But the content of their media diets 
would be impossible to capture. B.R. et al. 


available from the software and sensors built 
into digital devices (for instance, on time of 
day, location, even keystroke velocity). 

In any one screenome, screenshots are the 
fundamental unit of media use. But the par- 
ticular pieces or features of the screenome 
that will be most valuable will depend on the 
question posed — as is true for other ‘omes’. 
Ifthe concern is possible addiction to mobile 
devices, then arousal responses (detected by 
a change in heart rate, say) associated with 
the first screen experienced during a session 
might be important to measure. If the concern 
is the extent to which social relationships 
dictate how political news is evaluated, then 
the screenshots that exist between ‘social’ 
and ‘political’ fragments in the screenome 
sequence might be the crucial data to analyse. 
(News items flagged by aclose friend might be 
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perceived as more trustworthy than the same 
news obtained independently, for example.) 

How can researchers get access to such 
high-resolution data? And how can they 
extract meaning from data sets comprising 
millions of screenshots? 

One option is for investigators to collab- 
orate with the companies that own the data, 
and that have already developed sophisti- 
cated ways to monitor people’s digital lives, 
at least in certain domains, such as Google, 
Facebook, Amazon, Apple and Microsoft. The 
Social Science One programme, established 
in 2018 at Harvard University in Cambridge, 
Massachusetts, involves academics part- 
nering with companies for exactly this pur- 
pose”. Researchers can request to use certain 
anonymized Facebook data to study social 
media and democracy, for example. 

Largely because of fears about data leaks 
or study findings that might adversely affect 
business, such collaborations can require 
compromises in how research questions are 
defined and which data are made available, 
and involve lengthy and legally cumbersome 
administration. And ultimately, there is 
nothing to compel companies to share data 
relevant to academic research. 

To explore more freely, academics need to 
collect the data themselves. The same is true if 
they are to tackle questions that need answers 
within days — say, to better understand the 
effects of a terrorist attack, political scandal 
or financial catastrophe. 

Thankfully, Screenomics and similar plat- 
forms are making this possible. 

In our experience, people are willing to 
share their data with academics. The harder 
problem is that collecting screenomics data 
rightly raises concerns about privacy and sur- 
veillance. Through measures such as encryp- 
tion, secure storage and de-identification, it 
is possible to collect screenomes with due 
attention to personal privacy. (All our project 
proposals are vetted by university institutional 
review boards, charged with protecting human 
participants.) Certainly, social scientists can 
learna lot from best practice in the protection 
and sharing of electronic medical records” 
and genomic data. 

Screenomics data should be sifted using a 
gamut of approaches — from deep-dive qual- 
itative analyses to algorithms that mine and 
classify patterns and structures. Given how 
quickly people’s screens change, studies 
should focus on the variation in an individ- 
ual’s use of media over time as much as on 
differences between individuals and groups. 
Ultimately, researchers will be able to inves- 
tigate moment-by-moment influences on 
physiological and psychological states, the 
sociological dynamics of interpersonal and 
group relations over days and weeks, and even 
cultural and historical changes that accrue 
over months and years. 


ALL IN THE DETAILS 


Recordings of screenshots every five seconds reveal substantial differences in how 
two adolescents use their smartphones over 21 days (see ‘Under the microscope’). 
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Participant A’s time was spread over 186 sessions per day (with a session defined as the interval 
between the screen lighting up and going dark again). Each session lasted 1.19 minutes on average. 
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Zooming in on 2 hours of participant A’s activity on 
day 16 reveals more about how they spent their time. 


More than half of the apps that A engaged with were 
types of social media (mostly Snapchat and Instagram). 


Participant B 


Participant B’s time was spread over 26 sessions per day, lasting 2.54 minutes on average. 
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Participant B engaged with 30 distinct 


applications, mostly YouTube. 
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Some might argue that screenomics data 
are so fine-grained that they invite researchers 
to focus on the minutiae rather than the big 
picture. We would counter that today’s digi- 
tal technology is all about diffused shards of 
experience. Also, through the approach we 
propose, it is possible to zoom in and out, 
to investigate how the smallest pieces of the 
screenome relate to the whole. Others might 
argue that even with this better microscope, 
we will not find anything significant. But if 
relationships between the use of media and 
people’s thoughts, feelings and behaviours 
continue to be weak or non-existent, at least we 
could have greater confidence as to whether 
current concerns are overblown. 

The approachwe propose is complex, but no 
moresothan the assessment of genetic predic- 
tors of mental and physical states and behav- 
iours. Many years and billions of US dollars 
have been invested in other ‘omics’ projects. In 
genomics, asinneuroscience, planetary science 
and particle physics, governments and private 
funders have stepped up to help researchers 
gather the right data, and to ensure that those 
data are accessible to investigators globally. 
Now that so much of our lives play out on our 
screens, that strategy could prove just as valu- 
able for the study of media. 
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Readers respond 


Correspondence 


Rooibos settlement: 
acrucial omission 


We celebrate the compensation 
agreement between the rooibos- 
tea industry and South Africa’s 
Indigenous peoples (see Nature 
575, 258; 2019). As researchers 
inthe field, what concerns us are 
those left out of the story: small- 
scale farmers who have worked 
rooibos land for generations. 

It is their oral histories that 
informed how rooibos seeds 
were originally unearthed by 
following the paths of ants (a 
finding that led to the birth of 
the industry at scale), and how 
rooibos was used alongside 
breast milk to nurture their 
children. Yet these farmers 
do not fit neatly into the 
compensation-agreement 
narrative because most do not 
self-identify as Indigenous San 
or Khoi. Traditional knowledge 
does not necessarily havea 
clear-cut ethnic provenance. 

Although some small-scale 
rooibos farmers are descended 
from San and Khoi, many trace 
back to slaves and labourers 
brought in from other parts of 
Africa and from southeast Asia. 
The group was largely left out of 
the compensation negotiations 
and was eventually included 
only through a gesture by the 
National Khoisan Council. 
Whether the group will benefit 
in practice remains to be seen. 

If not, these communities will 
be further marginalized: by their 
exclusion from an Indigenous 
heritage, by their dearth of land 
and resources, and because they 
lack the power of a government- 
recognized council. 


Sarah Ives City College of San 
Francisco, California, USA. 
sarah.f.ives@gmail.com 


Rachel Wynberg University of 
Cape Town, South Africa. 


Graham Dutfield University of 
Leeds, UK. 


Sahel crisis: cut 
global emissions 


The recommendations by Alisha 
Graves and her colleagues for 
averting catastrophe in Africa’s 
Sahel overlook the need to 
address the drivers of its climate 
and ecological crises (Nature 
575, 282-286; 2019). 

Although the authors’ 
recommendations could help to 
prevent catastrophe in the short 
to medium term, immediate 
action is also needed against 
the fundamental causes if we 
are to mitigate crises of rapidly 
escalating scale and severity in 
the longer term. 

This might seem obvious, 
but it must be emphasized 
repeatedly if we are to ramp 
up our currently woefully 
inadequate action against the 
global climate and ecological 
emergency. Otherwise, there 
is animplicit acceptance of 
the status quo and the current 
trajectory. And the catastrophe 
inthe Sahel, and others like it 
around the world, will not be 
averted, but at best delayed. 


Keir Philip Imperial College 
London, UK. 
k.philip@imperial.ac.uk 
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Kurdistan: stop the 
cycle of strife 


lagree with some of the 
scientists who wrote to Nature 
from nations experiencing civil 
unrest: instability can wreck 
educational and research 
infrastructure (Nature 576, 
382-384; 2019). Several civil 
wars over the past half-century 
have certainly done soin 
Kurdistan — the geographical 
region divided between Iran, 
Iraq, Turkey and Syria. The 
continuing conflict against the 
Islamist terrorist group ISIS and 
Turkey is making matters worse. 
And civil war now looms in 
Rojava, the autonomous Kurdish 
region in northern Syria. 

The Kurds have been 
considered second-class 
citizens in their homelands for 
centuries. Other than in Iraqi 
Kurdistan, they have no right 
to speak in their own language 
or to wear traditional attire in 
state institutions. Research 
students must undergo security 
screening and are excluded 
from some subjects, including 
electronics and aerospace 
engineering. 

It pains me to think what 
another generation might have 
to live through in the region. 
Iwasa child inthe 1980s, 
when Iraq’s President Saddam 
Hussein’s use of chemical 
weapons killed more than 
3,000 Kurdish civilians in one 
day. He bombarded my city, 
Saqqezin Iran, forcing my family 
and thousands of others to flee. 
After the war, the damaged 
infrastructure of Iraq’s and 
Iran’s Kurdish regions — often 
there was electricity forjusta 
few hours each day — prevented 
most research laboratories from 
functioning. 


Ebrahim Karimi University of 
Ottawa, Canada. 
ekarimi@uottawa.ca 
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Futures: lost 
to the past 


lam deeply disappointed to 
see the loss of the Futures 
section from the print version 
of Nature. | read only the hard 
copy of the magazine, and 1 
looked forward every week to 
the science fiction on its last 
page. To be able to think deeply 
about contemporary problems 
and then abruptly shift toa more 
creative turn of mind was, to 
my knowledge, unique among 
scientific publications. 

The Futures section offered 
mean opportunity to make 
connections. For example, the 
mind- and time-bending short 
story ‘You will remember this’ 
by Justen Russell brought to life 
an illustration of two universes 
governed by different laws 
(Nature 574, 144; 2019). For me, 
this imaginative story found a 
route to reality as I read Philip 
Ball’s Comment article ‘Science 
must move with the times’, 
which speculated on the future 
scientific advances of humanity 
(Nature 575, 29-31; 2019). 

My more playful side will 
miss such indulgences when I 
am reading Nature to further 
my personal and professional 
development. 


Joseph Moore, Wilmington, 
Delaware, USA. 
jdmoorephd@gmail.com 
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News & views 


Malaria 


Mosquitoes get alegup 


on insecticides 


Flaminia Catteruccia 


Achemosensory protein enriched in the legs of malaria- 
carrying mosquitoes gives them resistance to insecticides 
used to treat bed nets. This discovery points to the challenges 


of tackling malaria. See p.376 


The wide distribution of insecticide-treated 
mosquito nets across malaria-endemic regions 
has drastically reduced the incidence of the 
disease over the past 20 years, and so has saved 
millions of lives’. However, malaria-carrying 
mosquitoes have now developed strong 
resistance to the pyrethroid chemicals used 
in these long-lasting insecticidal nets (LLINs). 
An understanding of the mechanisms under- 
lying resistance should reveal ways to make 
mosquitoes susceptible to insecticides once 
more. On page 376, Ingham etal.’ demonstrate 
an unexpected way in which mosquitoes 
in Africa neutralize pyrethroids: they use a 
class of small proteins normally involved in 
chemical communication. 

Malaria parasites are transmitted to humans 
by female mosquitoes of the genus Anopheles, 
with Anopheles gambiae being a major carrier 
of the disease. The first identified mechanism 
of pyrethroid resistance in wild Anopheles pop- 
ulations was a phenomenon called knockdown 
resistance, which involves mutations ina volt- 
age-gated sodium channel protein that reduce 
neuronal sensitivity to the insecticide*. Other 
mechanisms have also been identified, includ- 
ing enhanced metabolic activity of detoxifying 
enzymes such as cytochrome P450s (CYPs), 
which bind to and promote the breakdown of 
insecticides? (Fig. 1). 

The emergence of strong pyrethroid 
resistance in West African populations of 
A. gambiae’ prompted Ingham etal. to search 
for more mediators of resistance. The authors 
analysed the gene-expression profiles of insec- 
ticide-resistant A. gambiae populations from 
Burkina Faso and Cote d'Ivoire. To the authors’ 
surprise, they discovered higher-than-normal 
expression of genes that encode a family of 
chemosensory proteins, called sensory 
appendage proteins (SAPs). 


SAPs, like all chemosensory proteins, 
are found only in insects. They are small, 
soluble proteins that typically transmit 
chemical signals by transporting small 
hydrophobic molecules between cells. Ing- 
ham and colleagues found that reducing 
the levels of one of these proteins, SAP2, in 
pyrethroid-resistant A. gambiae significantly 
restored the mosquitoes’ susceptibility to pyre- 
throids. Conversely, overexpressing SAP2 in 


— Bed net 


©—— Pyrethroids 


an otherwise susceptible A. gambiae colony 
boosted the mosquitoes’ resistance levels. 

How does achemosensory protein interfere 
with insecticide activity? Ingham et al. show 
that SAP2 binds to pyrethroids with high speci- 
ficity, and that its expression is enhanced inthe 
legs of mosquitoes. These data suggest that 
SAPs sequester pyrethroids that penetrate the 
mosquito’s hard exterior when it lands onabed 
net, perhaps preventing the insecticide from 
exerting its toxic effect onthe nervous system 
by promoting its breakdown. 

Finally, Ingham etal. analysed the genomes 
of West African Anopheles populations col- 
lected over time, making use of an existing 
database as well as sequences that they had 
gathered. They found that a ‘selective sweep’ 
had occurred near the genomic region that 
encodes SAP2 — a phenomenon in which 
one particular version of a genomic region 
becomes more prevalent in a population 
as a result of natural selection. The authors 
showed that the sweep had occurred over the 
time in which pyrethroid resistance sharply 
increased, maybe owing to the beneficial 
effects of one version of this genomic region 
on survival. Taken together, Ingham and col- 
leagues’ data unveil chemosensory proteins 


Target-protein resistance 


Metabolic resistance 


CYP 


Cuticular resistance 


tt 


Skin — 


Chemosensory protein sequestration 
“2 a = 
= 
G 


Figure 1| Multiple lines of resistance. Malaria-carrying mosquitoes in West Africa have developed several 
mechanisms of resistance to insecticides called pyrethroids, which are used to treat bed nets”. Mutations in 
a voltage-gated sodium channel protein to which pyrethroids bind reduce the chemicals’ ability to recognize 
their target. Enhanced activity of metabolic enzymes such as the cytochrome P450s (CYPs) can break down 
the pyrethroids before they kill the insect. Thickening of the mosquito’s outer cuticular layer can slow 
insecticide penetration into the skin, thereby reducing the chemicals’ concentration. Ingham et al.? now 
show that, in the insect’s legs, the chemosensory protein SAP2 binds to pyrethroids, sequestering them and 


so preventing them from functioning. 
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as akey component of insecticide resistance 
in Anopheles mosquitoes. 

Chemosensory proteins represent a 
previously unknown class of resistance-causing 
factor, and so Ingham and co-workers’ find- 
ing points to a fresh opportunity to restore 
full susceptibility to pyrethroids in West 
African Anopheles populations. Resistance 
caused by CYP enzymes has been mitigated 
by adding compounds that interfere with 
CYPs to bed nets’; similarly, compounds that 
inhibit binding between SAP2 and insecticides 
could be generated and incorporated into 
next-generation LLINs. Moreover, the genomic 
region associated with SAP2 resistance can now 
be used asa molecular marker for tracking the 
spread of this resistance mechanism. In the 
future, it will be crucial to determine whether 
and howchemosensory proteins interact both 
functionally and spatially with other mecha- 
nisms of resistance, to inform the optimal 
design of resistance-management strategies. 

Although Ingham and co-workers’ 
study provides hope of reversing mosquitoes’ 
resistance to insecticides, it also highlights 
how skilled these insects are at escaping 
unwanted attention. Clearly, our understand- 
ing of insecticide resistance is far from com- 
plete, and we should expect other studies in 
different locations to identify yet more such 
mechanisms, acting at local or continental 
levels. Anopheles species have been populat- 
ing Africa for more than 100 million years — 
considerably longer than humans and our 
ancestors®. Such an enduring connection 
with their natural habitat isa testament to the 
challenge that we face when targeting these 
insects. 

The next generation of LLINs and indoor 
residual sprays (another method of delivering 
insecticides) is currently being deployed in 
Africa’. Simultaneously, new insecticide-based 
methods such as insect-attractive targeted 
sugar baits are being tested”°. But, like previ- 
ous interventions, these tools will probably 
undergo cycles of impactful but relatively 
short-lived success, followed by decreased 
effectiveness owing to the emergence of 
resistance. Beyond insecticides, other mosqui- 
to-control strategies will probably encounter 
similar resistance issues. These include meth- 
ods that rely on mosquito killing, such as 
mosquitocidal drugs”, and genetic systems 
designed to suppress Anopheles populations 
(reviewed in ref. 12). 

It is possible that the combined use of 
multiple strategies will break mosquito 
endurance and lead to population collapse. 
However, work from my group recently 
showed that strong selective pressures 
imposed on Anopheles females could actually 
favour malaria transmission, for instance by 
triggering acceleration of parasite growth 
rates”. To avoid this issue, mosquito-targeting 
interventions could be integrated with 
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approaches that block parasite development 
in the insects without causing them harm, 
thus reducing selective pressures. Further- 
more, mathematical models suggest that 
our chances of achieving sustainable malaria 
control could be improved by incorporating 
antimalarials into LLINs or indoor residual 
sprays, to kill parasites and prevent their 
transmission even when mosquitoes become 
resistant to insecticides". Similar results could 
be obtained by delivering antiparasitic agents 
through biological or genetic means”. 

Whatever the eventual solution, the road to 
malaria elimination remains long. Mosquitoes 
are sending clear signals that they will fight for 
their survival. 
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Evolution of circuits 
for machine learning 


Cyrus F. Hirjibehedin 


The fundamental machine-learning task of classification 
can be difficult to achieve directly in ordinary computing 
hardware. Unconventional silicon-based electrical circuits 
can be evolved to accomplish this task. See p.341 


Artificial intelligence (Al) has allowed 
computers to solve problems that were previ- 
ously thought to be beyond their capabilities, 
from defeating the best human opponents in 
complex games! to automating the identifi- 
cation of diseases”. There is therefore great 
interest in developing specialized circuits that 
can complete Al calculations faster and with 
lower energy consumption than can current 
devices. On page 341, Chen etal.? demonstrate 
anunconventional electrical circuit in silicon 
that can be evolved in situ to carry out basic 
machine-learning operations. 

Although computers excel at performing 
calculations that have well-defined answers, 
they have not been good at making guesses. 
For example, if you are thinking about selling 
your car, a computer is ideally suited for cal- 
culating the average price that similar cars 
have sold for, to help you determine your 
selling price. But by analysing the enormous 
digital data sets that are currently available, 
Al techniques such as machine learning can 
now teach computers to make sensible pre- 
dictions. One of the most basic operations that 
machine-learning algorithms can carry out 
when provided witha large set of inputs (such 
as the age of acar and how many kilometres it 
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has been driven) is classification into one of 
aset of categories, such as whether the car is 
in poor, fair or good condition and therefore 
whether you can expect to get the price you 
want for it. 

Using the structure of the human brain as 
inspiration, scientists and engineers have 
made substantial progress in developing 
specialized hardware to greatly reduce the 
amount of time and energy needed to per- 
form tasks such as classification*. There are 
also many unconventional device concepts 
for machine learning that are still in the early 
stages of development but that could offer rad- 
ical new capabilities. For example, researchers 
are exploring whether superconductor-based 
electrical circuits that work at only a few 
degrees above absolute zero, and that oper- 
ate at gigahertz frequencies with high energy 
efficiency, could enable machine-learning 
applications that are currently infeasible using 
conventional approaches’. 

Chen and co-workers’ circuit is also inspired 
by the brain, and represents a major depar- 
ture from typical electrical circuits. Normally, 
electrical current flows through circuits like 
water flowing ina river. If the river becomes 
so shallow that it is reduced to a set of small 


Charge 
puddle 


a Background 
material 


Control 
wire 


Output 
wire 


Hopping 


Figure 1| An unconventional circuit for machine learning. a, Chen et al.? demonstrate an electrical circuit 
in which charge hops between ‘puddles’ of charge in a background material. The operation of the circuit is 
tuned by applying voltages to control wires. Inputs to the circuit are provided by voltages on input wires, 

and the circuit’s output is determined by whether or not charge flows through an output wire. b, The control 
wires modify the regions of the circuit in which charge hopping can occur, thereby modifying which hopping 
paths can exist. An example of the effect produced by changing the voltage for a single control wire is shown 
here. The authors use their circuit to carry out basic machine-learning operations. 


puddles, the water can no longer flow because 
the puddles are isolated from each other by 
barriers formed by the riverbed. 

Although the barriers between water pud- 
dles are too wide for a water molecule to hop 
from one puddle to the next, this is not the 
case for electrical charge in ‘puddles’ of charge 
separated by nanoscale distances. Tools such 
as scanning tunnelling microscopes use the 
high sensitivity of a hopping process called 
quantum-mechanical tunnelling to routinely 
image features as small as individual atoms on 
surfaces*. This tunnelling is also at the heart of 
quantum-computing technologies that have 
made impressive advances in the past decade’. 

Previous work by some of the current 
authors’ produced isolated charge puddles 
froma collection of gold nanoparticles that 
were randomly deposited ona silicon surface, 
with insulating molecules between them. 
These puddles were used to implement cir- 
cuits that carried out conventional calcula- 
tions, rather than machine learning, and are 
at the heart of Chen and colleagues’ circuit 
design (Fig. 1a). Informationis input into such 
a circuit through electrical voltages that are 
applied using ordinary wires. The electric 
fields from these wires can alter whether or not 
hopping between neighbouring puddles can 
occur, and therefore can modify the hopping 
path for electrical charge through the circuit 
(Fig. 1b). The output of the circuit is deter- 
mined by whether or not electrical current 
flows through another designated wire. 

Because the distribution of charge puddles 
israndom, it would seem impossible to predict 
how sucha circuit would behave. However, the 
high sensitivity of tunnelling makes it possible 
to strongly modify the behaviour of the circuit 
using different control wires. This behaviour 
also cannot be easily predicted, but the previ- 
ous work showed that configurations could be 
reliably found that carried out logic operations 
for two inputs, such as indicating whether at 


least one or both of the inputs were on (called 
an OR gate and an AND gate, respectively, in 
binary logic). 

Although these earlier circuits did not 
perform machine-learning operations, the 
researchers did use a machine-learning algo- 
rithm to determine the control parameters 
needed to make the circuits carry out different 
operations. This algorithm is inspired by bio- 
logical evolution, starting froma set of random 
control parameters and using only the most 
promising outcomes to ‘breed’ new parameter 
sets for successive generations. The previous 
study was groundbreaking because it showed 
that a single circuit could be reprogrammed 
insituto execute any two-input logic operation 
by simply changing the voltages applied to five 
control wires. 

In the present work, Chen et al. greatly 
expanded this basic idea by overcoming some 
of its key limitations and using the circuits to 
perform Al operations. First, the authors found 
a way to produce charge puddles directly in 
silicon by randomly implanting atoms that 
donate small amounts of charge to the silicon 
itself. This method makes the devices more 
broadly manufacturable than they previously 
were and potentially compatible with the cur- 
rent generation of electronics, which are also 
mainly based on silicon. Second, in this new 
material system, the maximum temperatures 
at which hopping dominates the electrical 
flow in these circuits, and therefore at which 
the desired operation is viable, is increased 
from barely above absolute zero to room 
temperature. 

To demonstrate the increased potential of 
these devices, Chen etal. evolved circuits that 
could classify all 16 possible sets of 4 binary 
inputs (OOOO, 0001, ..., 1111, where O repre- 
sents no input on a given wire and 1 represents 
an input ona given wire). This classification 
was possible even when the number of control 
voltages was reduced from five to three. 
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The authors then incorporated this in silico 
4-input classifier into the more complex Al 
task of classifying a standard set of black and 
white images of handwritten digits, which 
were encoded as a 28 x 28 array of pixels, 
each with a value of either O (white) or 1 
(black) (see Fig. 4 of the paper’). To do this, 
Chen et al. subdivided the original array into 
sets of 2 x 2 neighbouring pixels and fed the 
value of each of these 4 pixels into the 4-input 
classifier’s input wires. The authors then set 
the control wires to perform classification for 
each of the 16 possible sets of 4 inputs, and 
passed all 16 outputs for each set of 2 x 2 pixels 
toa machine-learning algorithm run on con- 
ventional hardware that identified the digit 
inthe full image. 

Chen and colleagues’ hardware platform 
for classification is inherently scalable, and 
individual classifier devices can be run in 
parallel without any conflicts. In the current 
incarnation of the platform, the set-up used 
to perform the measurements limits the 
speed at which the classifiers can be oper- 
ated and, in turn, the energy efficiency. The 
authors suggest alternative ways in which 
the measurements could be implemented to 
greatly improve the speed and energy effi- 
ciency of the circuits; demonstrating such 
improvement will be crucial if these designs 
are to move from the research laboratory to 
real-world applications. 

Because the random distribution of charge 
puddles in both the previous and current 
designs is difficult to model, the circuits’ high 
sensitivity to the control voltages is essential 
for evolving a set of control parameters after 
fabrication. Although the absence of the need 
to precisely position the puddles makes the 
circuits easier to fabricate, their performance 
might be further enhanced by having pre- 
defined, atomically precise arrangements 
of the individual impurity atoms that donate 
charge to the silicon’. Such enhancements 
could include reproducibility of control 
parameters for different devices, improved 
reliability of operation at higher temperatures 
and reduced energy consumption. 
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News & views 


Alzheimer’s disease 


T cells make ahome 
in the degenerating brain 


Michael T. Heneka 


A subpopulation of adaptive immune cells patrols the brain and 
cerebrospinal fluid in people who have Alzheimer’s disease. 
This discovery should broaden our understanding of how the 
immune system can influence neurodegeneration. See p.399 


For decades, research into Alzheimer’s 
disease has centred on neurons. Only in the 
past few years have scientists identified a role 
for immune cells in the progression of this 
neurodegenerative disorder’. Most research 
has focused on the nonspecific, innate 
branch of the immune system. But Gate et al.” 
report on page 399 that an immune-cell sub- 
population belonging to the adaptive immune 
system — which remembers and responds to 
specific foreign invaders — might also have a 
role in Alzheimer’s disease. 

The authors isolated and analysed immune 
cells from the blood of healthy people and 
people who had Alzheimer’s disease or a 
precursor of the disease known as mild 
cognitive impairment (MCI). They discov- 
ered an immune-cell subpopulation called 
CD8* T effector memory CD45RA* (Tena) 
cells that was associated with MCI and 
Alzheimer’s disease. Teypa cells have previ- 
ously been linked to immunological memory, 
and they release inflammatory and cytotoxic 
(cell-death-promoting) molecules’. 

Analysis of a separate cohort of people 
who had Alzheimer’s disease revealed that an 
increased presence of Ty yp, Cells in the blood 
was associated with compromised cognitive 
performance. This finding could indicate that 
Temra Cells contribute to neuronal dysfunction 
by secreting inflammatory and cytotoxic 
molecules in the brain (Fig. 1). Alternatively, 
a damaging mechanism that causes cognitive 
dysfunction might also elicit an inflammatory 
Trmra-Ccell response in the blood. 

Gate et al. corroborated their findings 
in vitro, showing that stimulation with an 
inflammatory molecule caused immune 
cells from people who had MCI or Alzheim- 
er’s disease to release more interferon-y (a key 
pro-inflammatory protein) than did immune 
cells from people who did not have these 
conditions. This is consistent with another 
study*, which demonstrated that T cells 
derived from people who have Alzheimer’s dis- 
ease become more active than do those from 
healthy people when exposed to B-amyloid,a 
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protein associated with this disorder. 

The authors then asked whether the 
presence of Tey, cells could be used to predict 
disease status. Indeed, a machine-learning 
algorithm could use measurements of 


“This is perhaps the first 
evidence that clonally 
expanded T cells invade the 
cerebrospinal fluid in age- 
related neurodegenerative 
diseases.” 


Temra Cells (together with information about 
other immune-cell populations) to distinguish 
between healthy people and those with MCl or 
Alzheimer’s disease with about 80% accuracy. 
Many immune processes alter during ageing 
and soare of limited use for predictive clinical 
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testing, but age did not influence the level 
Of Tempra Cells. This type of technique, once 
refined, might therefore be used alongside 
biomarkers of neuronal damage and degen- 
eration for blood-based diagnostic tests, 
improving our ability to detect Alzheimer’s 
disease at an early stage. 

Next, Gate and colleagues analysed the 
brains of people who had died with Alzheimer’s 
disease. This revealed CD8* T cells (which 
might be Tyra Cells) in the perivascular space 
around the brain’s blood vessels, and at sites of 
B-amyloid deposition, as previously reported 
for T cellsin Alzheimer’s disease**. CD8* T cells 
are known to physically contact and sever neu- 
ronal processes, causing structures called 
neuritic spheroids to form nearby — another 
hallmark of Alzheimer’s disease’. Thus, it is 
conceivable that T,yjp, cells contribute to neu- 
ronal damage not only by secreting immune 
molecules, but also by directly damaging 
neuronal processes’. 

MCland Alzheimer’s disease are associated 
with changes in the number and proportion 
of T cells in the cerebrospinal fluid (CSF) 
that surrounds the brain and spinal cord®”. 
The investigators therefore asked whether 
Temra Cells were found in the CSF and whether 
there was evidence of ‘clonal expansion’ of this 
cellular subpopulation. Naive T cells each have 
different T cell receptor (TCR) proteins, but 
when the receptor is stimulated by a particle 
called an antigen, the cell proliferates to form 
clones of itself. The presence of more than one 
cell with the same TCR therefore indicates 
clonal expansion — a sign that T cells have been 
activated previously. The authors sequenced 
TCRs from an independent cohort of people 
and identified several T-cell clones, including 
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Figure 1 | T,yp, cells and age-related neurodegeneration. Gate et al.” report that the presence of immune 
cells called CD8* T effector memory CD45RA* (Te ya) cells in the brain is associated with Alzheimer’s disease. 
a, Evidence froma few people suggests that the cells are activated by binding between the T-cell receptor 
(TCR) and an antigen molecule (which can be froma host cell or a foreign invader). The cells then proliferate 
to produce an expanded pool of Tgypa-cell clones. The cells patrol the cerebrospinal fluid (CSF). D, Tryp, Cells 
might promote neuronal damage indirectly, by releasing inflammatory and cytotoxic (death-promoting) 
molecules, or directly, by physically interacting with and severing neuronal processes, causing the 
formation of structures called spheroids that are associated with Alzheimer’s disease. Alternatively, they 


might have no role in disease progression (not shown). 
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Temra Cells, in people with Alzheimer’s disease. 
This is perhaps the first evidence that clonally 
expanded T cells invade the CSF in age-related 
neurodegenerative diseases. 

Gate et al. then validated their result using 
gene-expression analysis. This revealed that the 
Temra-cell population was the predominantly 
expanded T-cell clonein each person who had 
Alzheimer’s disease. The population expressed 
various cytotoxic genes, and was enriched in 
the hippocampus — a brain region crucial for 
human memory. In line with this observation, 
hippocampal T-cell infiltration promotes cog- 
nitive decline ina mouse model of Alzheimer’s 
disease’®. The authors also found evidence for 
Temra-Ccell clones and gene-expression changes 
in the CSF of people who had another neuro- 
degenerative disorder, Parkinson’s disease, 
highlighting the possibility that different 
age-related neurodegenerative diseases share 
similar molecular underpinnings. 

Which antigens drive clonal expansion of 
Tempra Cells? By comparing TCR sequences from 
people who had MCland Alzheimer’s disease, 
the investigators found evidence that clonally 
expanded Tera Cells had been bound by two 
antigens produced by a virus of the herpes 
family, Epstein-Barr virus (EBV). However, it 
is important to note that a role for EBV infec- 
tion in neurodegeneration has not yet been 
reported, and Gate et al. make no suggestion 
that EBV is involved in the development of 
Alzheimer’s disease. 

Gate and colleagues’ data involve only a 
few patients and should be interpreted with 
great care, particularly given that EBV infects 
about 95% of people during early life”. Previ- 
ous work” has shown a complex relationship 
between herpes viruses and Alzheimer’s dis- 
ease in mice. One the one hand, B-amyloid 
fibres can entrap herpes viruses, extending 
survival in mouse models of Alzheimer’s dis- 
ease. But on the other hand, virus infection 
strongly increases B-amyloid deposition in 
these animals. 

In addition, a study" of 85 people who had 
Alzheimer’s disease found evidence of EBV 
DNA in the brains of only 6% of cases. All of 
these people carried the gene APOF4, which 
is associated with a high risk of Alzheimer’s 
disease and could explain why they devel- 
oped the disorder. The same study did find 
that antibody responses against EBV increased 
during cognitive deterioration and pro- 
gression of Alzheimer’s disease”. However, 
these responses are quite common in older 
people. Moreover, a recent meta-analysis 
found no correlation between herpes-virus 
infection and dementia”. Longitudinal studies 
involving many more people will be needed 
before solid conclusions can be drawn. 

It will be interesting to reconcile Gate and 
colleagues’ data with the finding” that T cells 
can restrain cognitive deficits in mouse 
models of Alzheimer’s disease. Analysis of 


less-prominent T-cell clones in people with 
and without disease might reveal other, 
potentially harmful — or even protective — 
subclones. In addition, the current study will 
no doubt renew efforts to define the cross- 
talk between innate and adaptive immunity 
in general, as well as in neurodegeneration. 
Perhaps, in the future, these interactions could 
be harnessed for diagnostic purposes or to 
develop therapeutic interventions. 
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Platelets have ahold 
over immune cells 


Filip K. Swirski 


Plaques are lipid-rich structures in the blood-vessel wall that 
can cause heart attacks or strokes if they rupture. It now seems 
that blood-cell fragments called platelets alter the function of 
immune cells in ways that accelerate plaque formation. 


Heart attacks or strokes might seem to be 
sudden events, but they are the consequences 
of acondition called atherosclerosis, which 
can be decades in the making. Atherosclero- 
sis involves the accumulation of lipids and 
immune cells into structures called plaques in 
the blood-vessel wall. If these plaques become 
unstable they can rupture, blocking blood flow 
and so depriving tissues such as the heart and 
brain of oxygen, respectively triggering a 
heart attack or a stroke. Identifying precisely 
how plaques grow at cellular and molecular 
scales is therefore crucial for understanding 
and so treating atherosclerosis. Writing in 
Science Translational Medicine, Barrett et al.' 
enrich our thinking about how atherosclero- 
sis evolves, providing evidence that platelets 
in the blood promote the formation of big- 
ger, more dangerous plaques by shaping the 
function of immune cells. 

Monocytes area class of short-lived immune 
cell crucial to host defence. They survey their 
environment, patrolling the vasculature and 
frequently migrating in and out of the blood to 
scout for injuries or infections. This movement 
is aided by endothelial cells, which demarcate 
the border between blood and tissue, and 
which producea panoply of monocyte-attract- 
ing chemical messengers, enabling mono- 
cyte surveillance of and migration across 
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the blood-vessel wall. Platelets — blood-cell 
fragments best known for making blood clots 
— likewise help monocytes to infiltrate the ves- 
sel wall by adhering to the cells to form mono- 
cyte-platelet aggregates. Precisely how such 
aggregates promote migrationis not clear, but 
itis known that platelets can deliver a variety of 
mediators to which monocytes can respond’. 

Because of their role in monitoring the vas- 
culature, monocytes are key to the develop- 
ment of atherosclerosis. Voracious eaters, they 
ingest lipids that accrue in plaques, before 
morphing into larger, less agile macrophages. 
As this transformation occurs, the cells can 
wreak inflammatory havoc, contributing toa 
feed-forward loop that generates bigger, rup- 
ture-prone plaques*. But because monocytes 
are crucial for host defence, eliminating them 
entirely is not therapeutically viable. Identify- 
ing and blocking factors involved in monocyte 
recruitment to plaques might, however, bean 
alternative strategy. 

Barrett and colleagues investigated inter- 
actions between platelets, monocytes and their 
descendent macrophages in mice that have 
abnormally high levels of cholesterol — a risk 
factor for atherosclerosis. They observed that 
platelets adhere to monocytes in blood more 
readily when mice have high cholesterol levels, 
bolstering the idea that monocyte-platelet 
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Figure 1| How platelets might promote plaques. Atherosclerosis is a condition in which immune cells 
and lipids aggregate in structures called plaques in the blood-vessel wall. Barrett et al.’ provide evidence 
for amodel in which blood-cell fragments called platelets promote plaque build-up in mice that have high 
cholesterol levels. In this model, platelets adhere to immune cells called monocytes, and promote (through 
an unknown mechanism) both production of inflammatory signalling molecules such as interleukin-1B 
(IL-1B) and movement to plaques. In plaques, monocytes ingest lipids and transform into macrophage 
cells. Platelets promote expression of SOCS3, a protein that induces macrophages to adopt inflammatory 
characteristics. The cells then secrete high levels of IL-1B and other inflammatory factors, and have alow 
capacity to ingest dying cells through efferocytosis. Together, these factors promote plaque growth. 


aggregates augment monocyte recruitment 
to growing plaques. In parallel, the authors 
performed single-cell RNA sequencing of 
immune cells retrieved from plaques, and 
found an increase in platelet-specific factor 
Pf4 on macrophages, suggesting that plate- 
let adherence persisted beyond monocyte 
recruitment. Platelets, it seemed, were also 
aggregating with macrophages. 

This liaison spells trouble. The group used 
specific antibodies to deplete platelets ina 
genetically engineered strain of mouse sus- 
ceptible to atherosclerosis, and compared 
the macrophages of these animals with 
those of counterparts that had not received 
platelet-depleting antibodies. Single-cell 
RNA sequencing revealed that exposure to 
platelets triggers increased production and 
release of plaque-enhancing inflammatory 
molecules by macrophages. Interleukin-1 is 
one such mediator — and, indeed, therapeutic 
blockade of this protein in humans attenuates 
cardiovascular disease’. 

Next, Barrett etal. provided further evidence 
thatthe presence of platelets accelerates plaque 
growth. In addition to inducing inflammato- 
ry-molecule production, platelets impaired 
macrophages’ capacity to ingest dying cells 
through efferocytosis, increasing the num- 
ber of undigested dying cells in plaques — a 
phenomenon that increases the likelihood of 
plaque rupture. Thus, platelets promote ather- 
osclerosis by fostering monocyte recruitment 
to plaques and by reprogramming macrophage 
function (Fig. 1). 

The authors next investigated the fac- 
tors that govern the switch in macrophage 
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function. Twotranscription factors, suppressor 
of cytokine signalling 1(SOCS1) and SOCS3, are 
known to influence macrophage behaviour’. 
Specifically, a low ratio of SOCS1 to SOCS3 
triggers gene-expression patterns that lead 
to inflammatory characteristics, whereas a 
high ratio prompts tissue-repairing traits. 
The team found that macrophages taken 
from plaques in platelet-depleted mice had 
a higher SOCS1:SOCS3 ratio than did macro- 
phages from untreated animals, indicating 
that platelets somehow alter this pathway 
in macrophages to trigger inflammatory 
characteristics. 

Finally, Barrett et al. asked whether their 
findings might be applicable to humans. They 
found that, ina group of women, the platelet 
count — and expression of genes that encode 
SOCS3 and interleukin-1B — was higher in 
those who had hada heart attack than in those 
who had not had one. Moreover, the authors 
report an inverse relationship between the 
SOCS1:SOCS3 ratio and markers of platelet 
activation in people with peripheral-artery 
disease. Thus, this mechanism is potentially 
relevant to human disease. 

Barrett and colleagues’ results are intrigu- 
ing. Platelet blood-clotting ability serves 
an essential function in wound healing, but 
it can be detrimental in the wrong context. 
Blood-thinning, anti-clot drugs, such as 
clopidogrel or aspirin, have well-documented 
therapeutic effects in preventing blood clots. 
The current study suggests that blocking plate- 
lets might have collateral antiatherosclerotic 
benefits, which aligns with previous work®. 

Of course, many questions remain. For 
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instance, it is unclear precisely how platelets 
foster monocyte recruitment and how they 
reprogram macrophages. There are clues to 
be found in other work, given that platelets 
are a source of various immune mediators””. 
Barrett et al. suggest that platelets stimu- 
late macrophages by releasing the protein 
S100A9, which triggers the inflammatory 
TLR signalling pathway in macrophages, but 
this possibility requires further exploration. 
Another question is whether distinct types of 
platelet have evolved for specialized cell com- 
munication. In support of this idea, research® 
suggests that large platelet-producing cells 
called megakaryocytes, which reside in dif- 
ferent locations, have differing functions. 
Finally, it will be important to know whether 
all macrophages are equally affected by plate- 
let instruction, or whether the partnership is 
specific to certain stages of development, 
anatomical locations or times. 

The authors caution against drawing 
sweeping conclusions, and, indeed, there 
are caveats to the study that should be con- 
sidered. For instance, it would be useful to 
reduce platelet levels by approaches other 
than the anti-CD42b antibody used here. 
This antibody is expected to deplete platelets 
only transiently, and it might have collateral, 
unforeseen effects. In addition, it would be 
valuable to visualize the aggregates in vivo, 
perhaps using electron microscopy, to obtain 
aclear picture of what a macrophage- platelet 
aggregate really looks like. Finally, future work 
will need to determine whether this phenom- 
enon occurs broadly in other situations 
involving monocyte recruitment and conse- 
quent macrophage activity, for instance in 
infected or injured tissue. 

Nevertheless, the study builds on a long 
line of work implicating platelets, mono- 
cytes and macrophages as key contributors 
to atherosclerosis. The conceptual power of 
exploring how immune and blood-clotting 
pathways intersect, theinsights into monocyte 
and macrophage function, and the corrobo- 
rating human data, are all worthy of further 
exploration. 
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Transparent crystals with 
ultrahigh piezoelectricity 


Jurij Koruza 


It has been difficult to make transparent materials that have 
extremely high piezoelectricity — a useful property related 
to the coupling of electric fields and mechanical strain. This 
hurdle has now been overcome. See p.350 


Piezoelectric materials show high electro- 
mechanical coupling, which means that they 
can generate large strains if an electric field 
is applied to them, and can transform exter- 
nal mechanical stimuli into electric charge or 
voltage’. They are widely used in electronic 
applications, including sensors, small motors 
and actuators — devices that convert electri- 
cal energy into movement. In addition, their 
high energy efficiency and ease of miniatur- 
ization are driving the development of new 
technologies, such as energy harvesters for 
the growing network of Internet-connected 
devices knownas the Internet of Things, actu- 
ators for touch screens and microrobots. On 
page 350, Qiu etal.’ report the preparation of 
high-performance piezoelectrics that have 
the long-desired property of near-perfect 
transparency to light. This breakthrough 
could lead to devices that combine excel- 
lent piezoelectricity with tunable optical 
properties. 

Most high-performance piezoelectrics 
are ferroelectrics — materials that have a 
spontaneous electric polarization that can 
be reversed by the application of an external 
electric field. At the atomic level, ferroelectrics 
have a local polarization that is caused by the 
displacements of certainions from their sym- 
metric positions. Regions that havea uniform 
direction of polarization are referred to as 
ferroelectric domains and are separated by 
boundaries called domain walls. The materi- 
al’s crystal structure determines the possible 
directions of polarization and, in turn, the 
types of domain wall. For example, arhombo- 
hedral crystal structure enables 8 possible 
domain variants and 71°, 109° and 180° domain 
walls (where the angle refers to the difference 
in polarization direction between the domains 
separated by the wall). 

For these ferroelectrics to be used as piezo- 
electrics, they must first undergo a process 
known as poling, in which an external elec- 
tric field is applied to the material to reorient 
unfavourably oriented domains and induce 
macroscopic polarization. Poled ferroelectrics 


show a large electromechanical response to 
an external electric field or to mechanical 
force, and this response is typically charac- 
terized by a quantity dubbed the piezoelectric 
coefficient. The magnitude of this coefficient 
depends to a large extent on the domain 
configuration; the other major contribu- 
tion comes from the crystal lattice. Some of 
the largest coefficients known today were 
reported** for ferroelectric crystals based 
on lead magnesium niobate-lead titanate 
(PMN-PT). These crystals have piezoelectric 
coefficients above 1,500 picocoulombs per 
newton (pC N), which is about ten times 
higher than those of most other ferroelectrics. 

The poling process is conventionally carried 
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out using direct-current (d.c.) electric fields. 
In rhombohedral PMN-PT crystals that are 
[001] oriented (a particular crystallographic 
orientation), d.c. poling results inthe removal 
of 180° domain walls and the formation of a 
laminar (layered) domain structure consisting 
of 71° and 109° walls (Fig. 1a). When light prop- 
agates throughsuchastructure, the difference 
between the refractive indices at each side of 
an encountered 71° wall induces scattering, 
resulting in the poled crystal having an over- 
all opaque appearance. The 109° walls in this 
configuration, however, do not give rise to 
scattering. 

Qiu and colleagues exploited this differ- 
ence in light scattering between 71° and 
109° domain walls. The authors simulated the 
evolution of domains in [001]-oriented rhom- 
bohedral PMN-PT crystals that were subjected 
to either d.c. or alternating-current (a.c.) elec- 
tric fields. These simulations showed that the 
application of an a.c. poling field (a method 
reported only in the past few years*®), instead 
of ad.c. one, greatly reduces the number of 
71° domain walls (Fig. 1b). Qiu etal. attributed 
this effect to a process referred to as domain 
swinging, whereby the 71° walls alternate 
between two crystallographic planes and tend 
to merge, thereby decreasing their number. 
The 109° walls, by contrast, remain almost 
unaffected by the a.c.-poling process. 

The authors then compared their simulation 
results with d.c.- and a.c.-poled [001]-oriented 
rhombohedral PMN-PT crystals that had been 


b a.c.-poled 


Figure 1 | Near-perfect light transmittance in a high-performance piezoelectric. a, Materials known as 
ferroelectrics contain regions of uniform electric polarization called domains (the different shades represent 
different orientations of polarization). These domains are separated by boundaries dubbed domain 

walls. When [001]-oriented rhombohedral ferroelectric crystals of lead magnesium niobate-lead titanate 
(PMN-PT) undergo a process called poling using direct-current (d.c.) electric fields, they contain both 71° and 
109° domain walls (where the angle indicates the difference in polarization orientation between the domains 
separated by the wall). The 71° walls cause incident light to be scattered, such that the crystals are opaque. 

b, Qiuet al? demonstrate that, when the poling is carried out using alternating-current (a.c.) electric fields, 
the number of 71° domain walls is greatly reduced. The crystals show near-perfect light transmittance and 
ultrahigh piezoelectricity — a property associated with the coupling of electric fields and mechanical strain. 


© 2020 Springer Nature Limited. All rights reserved. 


Nature | Vol 577 | 16 January 2020 | 325 


News & views 


grown by a modified version of a widely used 
approach called the Bridgman method. They 
studied the domain structure of the crystals 
using three techniques (high-resolution X-ray 
diffraction, polarized-light microscopy and 
birefringence imaging spectroscopy), and 
confirmed the removal of 71° domain walls 
throughout the a.c.-poled samples. These 
samples exhibited near-perfect light transmit- 
tance, large birefringence (an effect in which 
a material’s refractive indices are different 
along different axes) and ultrahigh piezoelec- 
tric coefficients, exceeding 2,100 pC N™. For 
comparison, ferroelectrics that have a similar 
level of transparency to these crystals, such 
as lithium niobate or polyvinylidine fluoride, 
typically have piezoelectric coefficients! of 
less than 40 pC N17. 

The report by Qiu et al. adds the optical 
component to high-performance piezo- 
electric crystals and therefore opens the door 


to the design of electro-optical-mechanical 
devices. Transparent actuators or motors 
could be used for touch screens in consumer 
electronics or for the development of invisible 
microrobots. Moreover, the crystals’ high 


“Transparent actuators or 
motors could be used for 
touch screens or for the 
development of invisible 
microrobots.” 


electromechanical performance and trans- 
parency could be harnessed in an imaging 
technology knownas photoacoustic imaging 
or in piezoelectric light guides. 

Although the authors demonstrated the 
applicability of their approach to several 
PMN-based crystals, it remains to be seen 


whether similar principles can be applied 
to other ferroelectric systems. Of particular 
interest are systems that remain polarized at 
high temperatures (above 100-150 °C), unlike 
PMN-based materials. Nevertheless, Qiu and 
colleagues’ discovery has arrived just in time 
to meet the growing demand for multitasking 
smart materials and hybrid devices. 
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Bacteria are under immense evolutionary pressure from their viral invaders— 
bacteriophages. Bacteria have evolved numerous immune mechanisms, both innate 


and adaptive, to cope with this pressure. The discovery and exploitation of CRISPR- 
Cas systems have stimulated a resurgence in the identification and characterization of 
anti-phage mechanisms. Bacteriophages use an extensive battery of counter-defence 
strategies to co-exist in the presence of these diverse phage defence mechanisms. 
Understanding the dynamics of the interactions between these microorganisms has 
implications for phage-based therapies, microbial ecology and evolution, and the 
development of new biotechnological tools. Here we review the spectrum of 
anti-phage systems and highlight their evasion by bacteriophages. 


Bacteriophages (phages) are viruses that infect bacteria and it has been 
estimated that there are 10” phages present in the biosphere’. Their 
abundance accounts for 20-40% of bacterial mortality daily”, and has 
a considerable impact on biogeochemical cycles. The pressure of 
phage infection on bacteria has resulted in the evolution of multiple 
bacterial immune systems, each of which hampers different stages 
of the phage life cycle*» (Fig. 1). Unsurprisingly, phages have evolved 
a myriad of ways to overcome these defences>*, which in combina- 
tion with phage diversity, has contributed to the diversity of bacterial 
immune mechanisms. 

Research interests in bacterial-phage interactions, and in particular 
bacterial defences, are manifold. First, although the importance of 
these interactions for global ecology is accepted, large sequencing 
efforts, such as the Jara Oceans project, are furthering our under- 
standing by showing that phages drive rapid evolution through the 
daily transfer of approximately 10” genes between bacteria’. Second, 
phage-based therapies are becoming feasible antibacterial treatments 
as alternatives to antibiotics, owing to the rise of antibiotic resistance’. 
For successful therapy, it is critical to understand how bacterial patho- 
gens might become resistant to phages and, therefore, recalcitrant 
to treatment. Finally, phage-resistant strains are required in different 
industries’ and fundamental research into phage-defence mechanisms 
has underpinned the development of these, and other, applications, 
such as gene editing and diagnostics”. The importance of bacterial 
immune systems to these areas has led to aresurgence in the discovery 
and characterization of phage-resistance mechanisms. Here we focus 
onthe diverse systems that bacteria use to resist phages and howtheir 
phage invaders can evade these immune mechanisms. 


Preventing adsorption 


Phages exploit at least three different lifestyles to reproduce. Viru- 
lent phages replicate exclusively through the lytic cycle, exploiting 
bacteria to make new phages before their release by cell lysis” (Fig. 1). 
Alternatively, in addition to the lytic cycle, temperate phages can enter 
the lysogenic cycle and form prophages that are integrated into the 


bacterial chromosome or maintained extrachromosomally! (Fig. 1). 
By contrast, filamentous phages cause chronic infections and are con- 
tinuously secreted from the bacterium without lysis”. For infection 
to occur, phages must adsorb to the cell surface by binding to phage 
receptors, and inject their genome (Fig. 2a). To prevent adsorption, 
bacteria can alter or disguise receptors through surface modification 
(Fig. 2a). For example, receptor mutations in ompU in Vibrio cholerae 
confer resistance to the vibriophage ICP2”. Bacteria can also use recep- 
tors as phage decoys. In this case, outer membrane vesicles (OMVs) 
that contain receptors bud off from Escherichia coli and Vibrio, and can 
bind to phages, reducing productive infections" (Fig. 2a). Nonethe- 
less, OMVs have complex effects on phage dynamics because they can 
also extend the host range of phages. Indeed, phage receptors were 
transferred by OMVs to Bacillus subtilis cells that previously lacked 
the receptor, rendering B. subtilis and other phage-resistant species 
susceptible to phages” (Fig. 2b). Although this provides only transient 
susceptibility, the receptors may subsequently facilitate the transfer of 
receptor genes through generalized transduction, which could lead to 
a permanent heritable change in phage susceptibility. Inhibiting DNA 
entry into the bacterial cell is another defence strategy. For example, 
the Imm and Sp proteins of phage T4 prevent the DNA of other T-even 
phages from being translocated across the membrane’*. However, 
systems that prevent DNA entry are typically encoded on prophages 
and inhibit infection by subsequent phages’. 

The fitness costs of receptor mutations have led to other 
strategies that impede attachment?. Phase variation enables the 
reversible expression of phage receptors, resulting in phage-resistant 
bacterial subpopulations*”. Furthermore, receptors can be masked, 
preventing recognition while retaining function. For example, capsules 
or exopolysaccharides provide phage resistance in Staphylococcus, 
Pseudomonas and other species'*”” (Fig. 2a). Subtle modifications 
can also disguise receptors from phages, suchas in Pseudomonas aer- 
uginosa, in which pilus and O-antigen modifications and type IV pili 
glycosylation occludes phages”>”!. 

Receptor modification can select for phages that recognize the 
mutated, or alternative, receptors. In coevolution studies, it was shown 
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Fig. 1|Anti-phage mechanisms act at different stages of the phage life cycle. 
Virulent phages replicate exclusively through the lytic cycle, whereas 
temperate phages may replicate through either the lytic or the lysogenic 

cycle. Bacteria have numerous anti-phage systems that function at different 
stages of the phage life cycle to prevent productive phage replication. Abortive 
infection mechanisms provide population protection and function at different 
stages of the phage life cycle, indicated by the blue dashed line. 


that the receptor-binding proteins (RBPs) of phages—which bind to the 
bacterial receptors—are often mutated (Fig. 2b). For example, phageA 
overcameaLamB receptor mutant of £. colithrough tail fibre mutations, 
some of which caused stochastic protein folding into different forms 
that enabled the recognition of a new receptor (OmpF) or mutated 
LamB””?, Access to anew receptor occurred through baseplate and 
tail fibre mutations in B. subtilis phage SPO1, which also resulted in an 
extended range of Bacillus hosts”. Instead of mutating, some phages— 
suchas the F. coli phage phi92—have more than one RBP, enabling the 
recognition of multiple receptors”. RBP variability can be further 
expanded by diversity-generating retroelements that result in hyper- 
variable RBPs. For example, the diversity-generating retroelement of 
the Bordetella phage BPP-1 creates variable tail fibres, changing the 
receptor tropism of the phage’. Finally, when receptors are masked 
by exopolysaccharide capsules, some phages produce depolymerases 
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that enable them to gain access” (Fig. 2b). Recent research on receptor 
interactions is driven by the goal to understand and manipulate phage- 
receptor interactions to extend the host range for biotechnological 
and medical applications””. 


Restriction-modification and related defences 


The biotechnological use of restriction—modification (RM) systems 
has led to these systems being the most well-characterized phage- 
resistance mechanism—they are highly diverse and ubiquitous, and 
are present in around 90% of bacterial genomes”®. These systems 
distinguish self from non-self DNA to recognize and destroy phage 
DNA after its injection. Discrimination is due to DNA modifications 
at specific sequences and is characteristic of anumber of anti-phage 
systems. Two components that are typically present in RM systems are 
a methyltransferase and a restriction endonuclease (Fig. 3a, b). Both 
recognize restriction-site sequences; the methyltransferase methyl- 
ates DNA and the restriction endonuclease cleaves the unmethylated 
sequence. A comprehensive review of RM systems has been published 
previously”’. A range of other phage-resistance systems have similarities 
to RM systems, but their functions appear to be more complex owing 
to the presence of additional genes. 

One RM-like phage-defence mechanism is the phage growth limi- 
tation (Pgl) system in Streptomyces coelicolor, which modifies and 
cleaves phage DNA**”, Pg! has three phases and requires four genes, 
pglW, psx, pglYand pglZ (Fig. 3a, c). Phages become methylated after 
infecting Pgl* bacteria and, following release, these phages can infect 
other cells. During subsequent infections, the modified phage DNA 
is cleaved. Hence, although the initial infected cell does not survive 
phage infection, it is able to mark the phage to ‘warn’ neighbouring 
cells*. Genes similar to pglZ from the Pgl system were identified in 
six-gene clusters, including brxABCL and pglX® (Fig. 3a). These were 
termed bacteriophage exclusion (BREX) genes and have been char- 
acterized in Gram-positive (B. subtilis”) and Gram-negative (E. coli*) 
bacteria. Akin to RM, BREX acts after DNA injection to prevent phage 
replication and lysogen formation, but differs as DNA cleavage was 
undetectable*. BREX further differs from Pg] in that it restricts phages 
upon first exposure”; however, the precise mechanism by which 
BREX prevents infection remains unresolved. 

Another RM-like system was recently identified—termed defence 
island associated with restriction-—modification (DISARM)* (Fig. 3a). 
Class 1 and 2 DISARM share three core genes, with each class having 
two distinct additional genes. Class 2 DISARM includes a five-cyto- 
sine DNA methyltransferase and a system from Bacillus paralicheni- 

formis prevented phage DNA accumulation by distinct families of 
double-stranded (ds)DNA phages. Notably, phages modified at a 
specific sequence, and therefore presumably masked from the RM- 
like system, were inhibited. Furthermore, DISARM offered protection 
against phages that lacked the sequence recognized by the RM-like 
system. These results indicate that the mechanism differs from classic 
RM systems. To add further mystery to the DISARM mechanism, the 
candidate nuclease was dispensable for resistance**. 

Phages have amassed strategies to counteract RM, and potentially 
RM-like, systems®*5"°, Phage DNA can become methylated by the host 
methyltransferase on entry, disguising the DNA from the host restric- 
tion endonuclease. The resulting phages become phenotypically 
RM- insensitive; however, this epigenetic avoidance is transient and 
is lost following infection of methyltransferase-deficient bacteria. In 
addition, RM sites are mutated, underrepresented or absent in phage 
genomes to prevent restriction®””’s (Fig. 3d). Phages also exploit modi- 
fied or unusual bases, such as hydroxymethylation, glycosylation, glu- 
cosylation and acetamidation to make these sites unrecognizable to the 
restriction endonuclease”. Specifically, coliphage 9¢ utilizes a deoxyar- 
chaeosine modification to avoid restriction”. Some phage proteins (for 
example, Ral from A and P22 phages) activate host methyltransferases 
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Fig. 2| Preventing phage adsorption. a, Bacteria have developed anumber of 
methods to prevent phage adsorption. These include altering (green), 
disguising (blue), modifying (red) or masking (blue circles) receptors and the 
use of decoy OMVs. b, Phages can co-evolve to recognize the modified 


and promote DNA modification to protect against restriction endonu- 
cleases*®*!, Phages can also encode methyltransferases, which protect 
their DNA from restriction” (Fig. 3d), such as the methyltransferase of 
the Bacillus phage SPR, whichcan modify three sites to protect against 
multiple nucleases****. Phages can also prevent degradation of their 
genomes using the defence against restriction (Dar) system. The Dar 
system of coliphage P1 limits DNA degradation by type | restriction 
endonucleases*“*. Dar proteins are injected along with the phage 
DNA and function in cis. Another successful anti-RM strategy is the 
direct inactivation of restriction endonucleases. The overcome classical 
restriction (Ocr) protein of coliphage T7 is expressed immediately after 
DNA injection, mimics DNA, and tightly binds and sequesters the EcoKI 
restriction endonuclease*”**. Routes of phage escape from the recently 
discovered, RM-like systems have yet to be thoroughly investigated. 
However, phages are likely to use similar anti-restriction mechanisms 
for DISARM and BREX. No phages that have escaped Pgl systems have 
been isolated’, suggesting that bacterial protection by this system 
may be more robust than other RM-like systems. 


CRISPR-Cas adaptive immunity 


The ability to cleave phage DNA ina sequence-specific manner is shared 
by both RM and clustered regularly interspaced short palindromic 
repeats (CRISPR)—CRISPR-associated protein (Cas) systems. However, 
CRISPR-Cas provides ‘adaptive’ immunity through the generation 
of memories of past phage encounters that guide sequence-specific 
immunity. CRISPR-Cas immunity is present in about half of sequenced 
bacteria and is mediated by three stages*' **: adaptation, expression 
and interference (Fig. 4a). The mechanistic diversity of CRISPR-Cas 
systems is considerable—currently there are two classes, six types and 
more than 30 subtypes**>>. 
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receptor, through mutations, and produce extracellular-matrix-degrading 
enzymes. OMVs canalso extend the host range of phages, by transferring 
receptors used by the phage to cells that previously lacked those specific 
phage receptors. 


Class 1 systems include types I, II and IV, which have multi-subunit 
Cas complexes. Various typeI CRISPR-Cas subtypes have been shown 
to provide phage resistance** , whereas type IV systems—which are 
most-closely related to type I—are poorly characterized and their role 
in phage resistance is unknown®. Type III systems differ from other 
class 1 systems, because they target both RNA and DNA®*. Resist- 
ance to lytic infection has been demonstrated by the type Ill systems of 
Staphylococcus epidermidis® **, Lactococcus lactis® and Streptococcus 
thermophilus’°; however, the RNA-dependent targeting provides toler- 
ance to prophages”. An interesting feature of type II systems is that 
Cas10 synthesizes intracellular signals (cyclic oligoadenylates) that 
bind an accessory RNase and unleash its promiscuous activity’”?. 
The RNase may have an abortive infection effect (see ‘Protecting the 
bacterial population’ section), adding a further layer of defence by 
inducing dormancy through unspecific cleavage of both host and phage 
transcripts”. 

Class 2 CRISPR-Cas includes type II, V and VI systems, which are 
characterized by single-subunit effectors. The first direct evidence 
that CRISPR-Cas provides immunity against phages was provided by 
the type II-A system of S. thermophilus” and was later shown in Strepto- 
coccus pyogenes’®. Type Il systems use Cas9 to generate dsDNA breaks, 
whereas type V systems use Cas12”. Although there are few studies 
that have investigated phage resistance by type Vsystems, it has been 
shown that the Francisella novicida system protects against phage 
infection in £. coli’®. The dsDNA breaks induced by class 2 systems have 
been exploited in biotechnology, but may be less effective for clearing 
phages. In support of this idea, class 2 systems are less common than 
typeI, which havea potentially more destructive DNA-shredding mech- 
anism». Finally, class 2 systems can recognize and cleave phage RNA. 
Indeed, Cas13 from the type VI system of Leptotrichia shahii cleaved 
phage MS2 RNA in E. coli”. Upon target recognition, Cas13 not only 
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Fig.3|RM-like systems. a, Many proteins and protein domains are shared 
between the RM, Pgl, BREX and DISARM systems. The blue genes indicate 
enzymes that are responsible for DNA modification (methyltransferases (MT)), 
the purple gene (pg/Z) encodes a conserved protein (an alkaline phosphatase) 
and orange genes inthe DISARM system indicate core genes. RE, restriction 
endonuclease. pg/W encodes a serine/threonine kinase; pg/X encodes an 
adenine-specific methyltransferase; pg/Y encodes an ATP-binding protein; 
brxA encodes an RNA-binding anti-terminase; brxB encodes a protein withan 
unknown function; brxC encodes an ATP-binding protein; brxL encodesa 
protease; drmA encodes a putative helicase; drmB encodes ahelicase- 
associated protein; drmC encodes phospholipase D/nuclease; drmD encodes 
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cuts complementary transcripts, but also becomes a promiscuous 
RNase””®°. This promiscuous RNase activity can cleave phage MRNAS 
and host RNAs, inducing dormancy and providing Cas13-mediated 
resistance against dsDNA phages®° 

The sequence specificity of CRISPR-Cas selects for phages with 
mutations in targeted regions (Fig. 4b). Indeed, mutations in proto- 
spacer-adjacent motifs and spacer targets (i.e. protospacers) enable 
phages to overcome type! systems”°? *!*! and type Il systems® ®. Inser- 
tions, deletions and recombination events can also mediate phage 
escape”*”®!§+85, However, type I systems have a positive feedback 
mechanism to restore or enhance immunity by acquiring multiple 
new spacers that target escape phages—a process called priming”. 
There is now bioinformatic and experimental evidence that priming 
occurs in type Il systems®**”, Nevertheless, phages can evade primed 
strains with multiple spacers by deleting the target region™. 

As type V and VI systems can also degrade non-specific single- 
stranded (ss)DNA (type V) or RNA (type VI), they might provide an 
additional layer of resistance, which may explain why escape phages are 
yet to be identified for these systems”. In agreement with this notion, 
dormancy induced by type VI systems suppressed the emergence 
of escape mutants and protected the bacterial population against 
phages®. Similar to RM system evasion, phages can modify DNA to 
reduce Cas complex binding and cleavage—as seen for T4 evasion of 
type I-E and I-A systems”, 

Escape mutations can lead to phage fitness defects, and if essential 
genes are targeted, escape might be impossible. As an alternative, some 
phages have anti-CRISPR (Acr) proteins that inactivate CRISPR-Cas 
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Methylated phage DNA 


Remove restriction sites in phage DNA 
= 


Encoding a MT to methylate phage DNA 


an SFN2 helicase; and drmM1 encodes an N°-adenine DNA methyltransferase. 
b, (1) RM restricts any phage DNA that is not modified by methylation; (2) 
however, modified phages (green phage; see d) canreplicate on RM‘ strains. 
(3) Modified or (4) unmodified phages canreplicateonanRM strain but will 
lose any modifications. c, Pgl systems only restrict phages that have been 
previously exposed to the system. (1) A naive phage can replicate on Pgl*, (2) but 
upon secondary infection of a Pgl’ strain, the phage (shown in green) is 
restricted. (3) Modified (yellow) or (4) unmodified (grey) phages can replicate 
onPgl strains. d, Mechanisms of phages for avoiding RM and RM-like systems 
include methylation of DNA, removing recognition sequences from their 
genome and encoding a methyltransferase to methylate the phage DNA. 


systems®**? (Fig. 4b). Acrs have been identified for type, II, III, Vand VI 
systems and most interact with the Cas proteins to block activity?” 
Recently, an Acr has been shown to acetylate a type V system to prevent 
DNA binding”, and another inactivates Cas12 by triggering cleavage 
of CRISPR RNA bound to Cas12”. Notably, some phages must cooper- 
ate to exploit their Acrs. Acrs produced by the first phage that infects 
can immunosuppress the host, but may fail to fully protect the phage 
from CRISPR-Cas, while enabling a productive infection by succes- 
sive phages”. It is possible that Acrs have provided a selection for 
CRISPR-Cas diversity, but the ecological importance of their mecha- 
nistic diversity is unclear (see Box 1). 

Phage defences such as CRISPR-Cas are sometimes encoded by 
phages”*””. For example, CRISPR arrays occur in prophages of Clostrid- 
ium difficile, which target other C. difficile phages, and CRISPR-Cas 
systems are also present in ‘huge’ phages”*’”°. In many phages, these 
systems are incomplete—lacking genes for adaptation or interfer- 
ence””’", Phages that contain these incomplete systems have been 
proposed to co-opt the required proteins from the host, or repress 
transcription without cleavage, akin to RNA interference’. These 
phage-encoded CRISPR-Cas components may also eliminate com- 
peting phages and manipulate the hosts'™. Indeed, acomplete system 
expressed by a Vibrio phage can protect against a host defence island”. 
Phages can also transduce CRISPR-Cas systems between bacteria, 
which can provideimmunity against other phages”. These examples 
highlight how some phages have manipulated CRISPR-Cas systems 
as away to avoid defence systems in the host and endow them withan 
advantage over competing phages. 


pre-crRNA 


Anti-CRISPRs 


Fig. 4| CRISPR-Cas adaptive immunity and how phages overcome the 
CRISPR-Cas adaptive immune system. a, Schematic of the three stages of 
CRISPR-Cas immunity, including adaptation (stage 1), expression and 
maturation (stage 2), and interference (stage 3). crRNA, CRISPR RNA. b, Phages 
have the ability to overcome CRISPR-Cas defences through point mutationsin 


Protecting the bacterial population 


In contrast to RM and adsorption inhibition, which confer individual 
benefits, abortive infection (Abi) anti-phage systems protect the bac- 
terial population®. Abi is characterized by successful phage entry; 
however, development is interrupted, resulting in the release of few, 
ifany, phages and the host cell dies, which prevents a phage epidemic 
and protects the bacterial population’™. ‘Altruistic’ Abi systems are 
widespread in Gram-positive and Gram-negative bacteria’; however, 
as Abi systems are defined by phenotype, rather than genotype, their 
discovery has been sporadic'”’. Nevertheless, the presence of many 
Abi systems on plasmids has been used successfully to identify these 
systems, particularly inlactococci’®’. The mechanistic details of phage 
abortion are unknown for many systems, although disruption of essen- 
tial processes, such as replication, transcription, translation and DNA 
packaging is common*™, 

An Abi mechanism in S. epidermidis was recently shown to involve 
a serine/threonine kinase (Stk)"®®. Activated Stk phosphorylates pro- 
teins involved in translation, transcription, cell cycle control, the stress 
response, central metabolism, and DNA topology and repair’™. Death 


Bacterial cell 


DNA eae 
eo 
Mature crRNAs 


Nucleus-like 
structures 


the protospacer-adjacent motif (PAM) or protospacer, deletions or 
modifications of the DNA so that the DNA cannot be bound by Cas complexes. 
Phages can also encode protein anti-CRISPRs that can interfere with CRISPR 
immunity, and jumbo phages produce a nucleus-like structure that excludes 
Cas complexes, thus preventing DNA targeting. 


of infected bacteria occurs through this phosphorylation pathway, 
decreasing phage release and protecting the population’. The pres- 
ence of serine/threonine kinases in eukaryotic viral defences suggests 
there are shared immune strategies between these kingdoms. Kinases 
also play wider roles in viral defence in bacteria, with examples in the 
BREX and Pgl systems*°?”. 

The phenotypic definition of Abi systems is also reflected in their 
mechanistic diversity. For example, F. colilambda lysogens encode 
RexAB, which is activated by a poorly-characterized T4 phage protein- 
DNA complex’**”°°. When triggered, RexA activates RexB, which formsa 
membrane channel that leads to ATP leakage, lost membrane potential 
and phage exclusion’. RexAB-like systems are widespread, with their 
recent identification in actinobacteriophages. For example, rexAB-like 
genes in Mycobacterium smegmatis and Gordonia terrae prophages 
abort multiple phages”*"”. In each host, phage escape mutants were 
identified and all contained mutations in the proteins that triggered 
RexAB activity”®. 

Asubset of Abi systems function through a toxin-antitoxin mecha- 
nism. Toxin-antitoxin systems are composed ofa toxin and an antitoxin 


Nature | Vol577 | 16 January 2020 | 331 


Review 


Box 1 


Ecology, evolution and phage 
defence systems 


Interactions between bacteria and phages can have important 
consequences for microbial communities and it is essential 

to study these in more natural contexts”"*. Cocultures of 
bacteria and phages, which enable the assessment of changes 

in phage resistance and susceptibility, can provide insights into 
coevolutionary dynamics. For example, in early experiments with 
E. coli and T-even phages, bacteria became resistant through 
surface modification’®"", whereas experiments with Pseudomonas 
fluorescens SBW25 and phage 2 showed coevolution of these 
microorganisms over time'®?"**. When monitored in soil, the 
coevolution of P. fluorescens and ©2 still occurred, but in a 
different manner'®. The differences that were observed were due 
to the reduced nutrients, which increased the growth-rate costs 
of phage resistance. This example highlights the importance 

of considering the ecological context and communities when 
studying phage resistance. 

Evolutionary studies also provide insights into conditions in 
which bacteria might favour different resistance mechanisms. 

For example, P. aeruginosa used CRISPR-Cas when grown under 
nutrient-limiting conditions, whereas excess nutrients typically 
led to surface mutations. Both resistance outcomes were costly 
to bacteria, yet each mechanism was favoured under different 
ecological conditions”. 

Although laboratory experiments that mimic natural ecological 
and evolutionary scenarios benefit from being easy to manipulate, 
they cannot replicate environmental complexities. Metagenomics 
provides one way to complement the laboratory approach and has 
enabled the high-resolution examination of bacterial and phage 
communities from complex environments’. This technique has 
been useful for following the evolution of CRISPR-Cas resistance 
and subsequent phage escape in diverse environments, including 
acid mine drainage, the human gut, hyper saline lakes and a 
fish farm'*"’, Phages and bacteria can be monitored over time, 
providing valuable insights into the role of CRISPR-Cas immunity 
in shaping microbial communities. Furthermore, metagenomics 
has been key for identifying new anti-phage systems”’ and 
CRISPR-Cas variants’”0". 

Finally, mathematical modelling of bacteria-phage interactions 
provides insights into coevolutionary dynamics, helps to 
explain experimental observations and predict the influence of 
other ecological variables that can be difficult to manipulate 
experimentally'"”""”5, To gain a more complete understanding of 
phage-bacteria interactions and phage resistance, we must use a 
multidisciplinary approach by combining these complementary 
research areas with molecular studies. 


that are usually co-transcribed. The toxins targets essential cellular 
processes, leading to bacterial dormancy or death. There are six types 
of toxin-antitoxin systems, based on the identity of the gene prod- 
ucts (RNA or protein) and whether, and how, the toxin and antitoxin 
interact”. ToxIN, a type Ill system from Pectobacterium atrosepticum 
was the first example of an Abi system that was shown to function asa 
toxin-antitoxin mechanism™ and this has now been observed for other 
Abisystems'®"’, Different toxin-antitoxin types can elicit phage resist- 
ance, but have not beenstrictly classified as Abi systems, as the outcome 
for the infected bacterium was not defined. Examples in £. coliinclude 
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hok/sok (type) and rnlA/rnlB (type Il), which exclude phage T4"°™, and 
mazEF (type Il), which excludes phage P1””. Many of these toxins are 
RNases, a characteristic shared by several Abi systems. For example, 
E.coliPrrC isan RNase that cleaves lysine transfer RNA (tRNA”’) during 
infection, and only T4 phages that are able to repair this cleavage can 
replicate*. Thus, mutant T4 phages that lack a polynucleotide kinase 
or RNA ligase are aborted due to tRNA" cleavage”. 

To bypass toxin—antitoxin systems, phages can encode antitox- 
ins. For example, T4 produces Dmd, an antitoxin that inhibits F. coli 
RnlA and LsoA toxins”. Dmd differs from the RnIB or LsoB antitoxins, 
suggesting it evolved independently, which is highlighted by its differ- 
ent toxin neutralization mechanism™. Phages can generate diversity 
for escape by acquiring host genetic material through recombination. 
Indeed, recombination between lytic phages and resident lactococcal 
prophages led to Abi escape through gene loss or gain’. Recombina- 
tioncan also promote antitoxin acquisition by phages. For example, to 
escape ToxIN, phages containing a short tox/-like sequence recombined 
with tox/Nand directly gained tox/’. Notably, in other escape phages, 
tox!-like sequence duplications produced pseudo-ToxI RNAs that 
inhibited ToxN™®. Rather than encoding its own antitoxin, coliphage 
T7 evades atoxin-antitoxin system by producing a protein that has been 
proposed to prevent antitoxin degradation by the Lon protease. This 
ensures that the toxin remains inactive by increasing the stability of the 
host antitoxin’”. Finally, the T4 protein Alt (an ADP-ribosyltransferase) 
is injected with phage DNA, which chemically modifies the MazF toxin™®. 
ADP-ribosylated MazF has reduced cleavage activity, enabling the 
survival of the phage”®. 

Many new Abi systems await discovery and, indeed, new systems 
in different strains are still being uncovered. For example, Abia was 
recently identified in Enterococcus faecalis and leads to asynchronous 
lysis"’. To understand Abi responses, the phage genes involved can 
be revealed by isolating escape mutants. For example, ToxIN can be 
overcome by specific mutations in @M1and T4-like phage proteins”°!, 
However, the often toxic and poorly characterized nature of the phage 
Abi-triggering proteins is a frequent challenge for mechanistic studies. 


Prophage-encoded defence systems 


Prophages can have immune systems that prevent subsequent phage 
infection of lysogens (for example, rexAB). These non-essential tran- 
scribed regions or genes within prophage genomes have been called 
‘morons’ and canencode factors that benefit the host, suchas defence 
systems’. For example, morons (or ‘immunity cassettes’) within 
M. smegmatis prophages provide phage defence by encoding RM and 
toxin-antitoxin components, and other defence systems’. These 
systems can be remarkably specific; for example, prophage Charlie 
encodes a defence system that offered protection against only one 
phage of many tested. A different M. smegmatis prophage encoded a 
(p)ppGpp synthetase similar to RelA/SpoT that is proposed to be inac- 
tivated by a prophage ‘regulator’ protein. Lytic phage replication leads 
to rapid dissociation of the synthetase from the regulator, (p)ppGpp 
accumulation, growth cessation and stalled phage development’. 
Another phage, Fruitloop, encodes an immunity protein that interacts 
with Wag31, a cell-wall synthesis protein in M. smegmatis. Fruitloop 
inhibits superinfection by other phages that are thought to require 
Wag3] for DNA injection. Prophage-mediated phage defences are 
widespread. Indeed, a systematic study revealed that Pseudomonas 
lysogens have diverse prophage-encoded defences”. Furthermore, 
filamentous phages of the /noviridae family that cause chronic infec- 
tions were recently shown to have multiple toxin—antitoxin systems and 
superinfection systems”. As these systems are encoded by the phage, 
phage escape represents phage-phage coevolution. Accordingly, many 
genes of unknown function in prophages, especially within morons, 
may protect from superinfection and this knowledge may accelerate 
the identification of candidate resistance systems. 


Anewworld of diverse resistance systems 


Defence systems are often clustered in defence islands in bacterial 
genomes and unknown genes within these regions have been pro- 
posed to encode anti-phage systems™””®. This was supported by the 
discovery of BREX, DISARM and the Stk2 kinase*!**3*, and was the 
premise for asearch that uncovered 26 broadly distributed candidate 
defence systems’. Nine systems have been validated as anti-phage 
systems, some of which protect against specific phages, whereas others 
provided broader defence. Although the mechanisms are undeter- 
mined, multiple protein domains have been identified that are typical 
for phage-defence systems (for example, helicases and nucleases), 
in addition to proteins that have been proposed to be repurposed for 
phage defence. For example, components of the Zorya system, which 
is proposed to be an Abi system, show homology to the MotAB proteins 
that form the stator of the flagella complex, and are hypothesized to 
form amembrane channel that results in depolarization and cell death 
upon phage infection’. 

Prokaryotic Argonaute (Ago) proteins are also found in defence 
islands nearby other newly discovered and validated systems (for 
example, Thoeris), suggesting that they may also elicit phage 
defence”””®, Moreover, the eukaryote Ago proteins are key proteins 
in RNA-interference systems, and prokaryotic Ago proteins function as 
nucleic-acid-guided nucleases”’. Generally, prokaryotic Ago proteins 
generate and associate with short-interfering DNA or RNA guides. The 
single-stranded guides facilitate the identification of the complemen- 
tary sequence by prokaryotic Ago, which cleaves the target strand or 
produces double-stranded breaks®° *?, Following the discovery of 
prokaryotic Ago proteins, further parallels are being drawn with the 
eukaryotic immune systems—for example, with the eukaryotic cGAS- 
STING pathway that senses viral DNA and activates an innate immune 
response. Recently, prokaryotic cGAS homologues, which cluster near 
defence islands, have been identified"**. These cGAS-encoding genes 
reside in operons that include a phospholipase and two other genes 
that contain eukaryotic-like domains that are required for defence 
against some phages, but are dispensable for the defence against oth- 
ers. This pathway was named CBASS (cyclic-oligonucleotide-based 
anti-phage signalling system) and is triggered by an unidentified signal 
that causes cGAS to produce cyclic GMP-AMP (cGAMP). cGAMP acti- 
vates the phospholipase, which aborts a range of dsDNA phages by 
eliciting membrane damage and cell death‘. A second example is the 
eukaryotic-like HORMA proteins that are present in various bacteria, 
including E. coli’. These proteins sense unknown phage product(s) 
and, once activated, the HORMA domain activates a cGAS/DncV-like 
nucleotidyltransferase that produces the second messenger cyclic 
tri-AMP. Cyclic tri-AMP causes dsDNA cleavage by activating an endo- 
nuclease, whichin £. coliconfers A immunity”. It is currently unknown 
whether this results in abortive infection or targeted destruction of 
the phage’. The discovery of eukaryotic-like defences in prokary- 
otes suggest that systematic searches for homologues in bacteria may 
uncover many new anti-phage systems. 

Recently, a new type of phage defence was discovered that relies 
on small molecules rather than proteins”. This chemical defence is 
widespread in Streptomyces, a genus known for the prolific production 
of bioactive secondary metabolites. The metabolites block genome 
propagation by intercalating dsDNA. Because many secondary metabo- 
lites can diffuse and thus function outside of the cell, this has been 
proposed as an innate defence that protects bacteria before phage 
infection’. However, various aspects of the chemical defence strategy 
remain unclear, such as how the phage DNA is recognized as non-self. 

With such a diversity of defence systems, the arms race has esca- 
lated. Indeed, jumbo phages produce nucleus-like structures inside 
the infected bacterium, in which phage DNA replication and transcrip- 
tion occur®*”. In P. aeruginosa, this nucleus-like structure protects 
(KZ from type I-C, Il-A and V-A CRISPR-Cas and a typeI RM system””. 


Moreover, in Serratia, a distinct nucleus-forming jumbo phage evades 
the native DNA-targeting type I-F and I-E CRISPR-Cas systems™°. 
However, phage mRNA translated in the cytoplasm is susceptible 
to RNA-targeting by Cas13”” or type III-A defence in P. aeruginosa 
and Serratia, respectively. Therefore, this physical occlusion of the 
phage genome appears to be a widespread method to overcome anti- 
phage systems and this is supported by a paucity of typeI spacers that 
target jumbo phages in nature, whereas type III-A spacers are over- 
represented*°, 

Finally, extracellular chemicals not only engage in direct resistance 
against phages (for example, chemical defence””’), but also facilitate 
communication to pre-empt bacteria to increase their immunity. 
Indeed, quorum sensing—cell-density-dependent signalling—upreg- 
ulates bacterially encoded CRISPR-Cas and downregulates surface 
receptors when populations would otherwise be at increased risk of 
a phage epidemic “*. Perhaps unsurprisingly, phages also use com- 
munication to ensure productive infection“*”. These peptide com- 
munication systems (which are also knownas arbitrium) are diverse and 
widespread, and inform phages about host availability. Arbitrium has 
been proposed to limit phage-induced host decimation by determining 
whether phages enter the lytic or lysogenic life cycle* “°, Phages also 
encode LuxR-type proteins, which respond to Gram-negative quorum- 
sensing signals“*5° and quorum-sensing genes are also present in 
Gram-positive phages™. Although the function of phage quorum- 
sensing genes remains to be elucidated, they might allow phages to 
sense host density’. These examples of communication between 
phages and bacteria raise the question whether bacteria and phages 
engage in ‘espionage’, where either party listens in to, or interferes with, 
the communications of the other to manipulate the outcome for their 
own benefit. However, the roles and implications of phage-phage and 
phage-bacteria communications remain to be understood. 


Perspectives 


There is a clear diversity of phage-resistance mechanisms and ways that 
phages evade these systems. This knowledge is informing microbiology, 
the potential of phage-inspired therapeutics and new biotechnological 
tools. Despite considerable advances, we are far from understanding 
bacterial defences and phage counter-adaptation across scales—from 
molecules, single cells, communities, ecosystems and through to the 
global scale (Box 1). Furthermore, the recent discovery of completely 
new systems demonstrates that our view of the defence arsenal is 
incomplete, and that their identification requires more systematic 
approaches. Increased sequencing data will expand the success of 
bioinformatics strategies, but these need to be complemented by high- 
throughput experimental techniques. For example, phage-based posi- 
tive selection of new anti-phage systems from metagenomic libraries 
could be exploitedina similar manner to those reported for anti-CRISPR 
discovery’. To advance the field, both sides of this arms race, the bac- 
teria and the phages, must be considered. 

Interms of bacterial defences, critical gaps exist in our understand- 
ing of molecular mechanisms—for both old and new systems—and 
new techniques should be applied to uncover their mode of action. 
Determining the molecular mechanisms of diverse defences will 
undoubtedly lead to both fundamental biological knowledge and new 
technologies—as exemplified by the exploitation of CRISPR-Cas and 
RM systems. Furthermore, most defence systems have been studied 
without considering other co-existing immune mechanisms. Indeed, 
bacteria often have multiple CRISPR-Cas systems, in addition to other 
innate defences. How these function together—whether redundantly 
or synergistically—is not well understood, but they may help bacteria 
to resist diverse phages and overcome escape phages’. In fact, RM and 
CRISPR-Cas act together to increase phage resistance, and crosstalk 
between CRISPR-Cas systems can provide protection against escape 
phages'*5, In addition, each defence system is likely to have different 
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costs and benefits that depend on the niche inhabited and these factors 
may be key drivers in the evolutionary selection of defences. 

Our understanding of phages is improving, in part due to the 
increased availability of sequencing data, but given their global abun- 
dance, we only have atiny snapshot of this ever-changing community. 
Poor functional gene annotations highlight the gaps in fundamental 
phage biology and hinder our ability to understand their interactions 
with bacterial immune systems. We can focus on genes that prob- 
ably influence bacterial immunity. For example, prophage-encoded 
defences and anti-defences are commonly found in particular genomic 
locations and their discovery has been facilitated by comparative 
genomics of phage families. Moreover, early expressed genes often 
have important roles in anti-defence or bacterial takeover’*; however, 
studying these genes has been hampered by the paucity of genetic tools 
for phages. Reassuringly, phages are becoming genetically tractable 
due to CRISPR-Cas methods. To realize the ecological importance, 
and the therapeutic and biotechnological implications of bacterial 
immune systems, mechanistic studies must be complemented with 
evolutionary and ecological experiments to illuminate how molecular 
events scale to global microbial processes. 
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The central 0.1 parsecs of the Milky Way host a supermassive black hole identified 
with the position of the radio and infrared source Sagittarius A* (refs. 1), a cluster of 
young, massive stars (the S stars’) and various gaseous features**. Recently, two 
unusual objects have been found to be closely orbiting Sagittarius A*: the so-called 
Gsources, Gland G2. These objects are unresolved (having a size of the order of 

100 astronomical units, except at periapse, where the tidal interaction with the black 
hole stretches them along the orbit) and they show both thermal dust emission and 
line emission from ionized gas° °. Gl and G2 have generated attention because they 
appear to be tidally interacting with the supermassive Galactic black hole, possibly 
enhancing its accretion activity. No broad consensus has yet been reached concerning 
their nature: the G objects show the characteristics of gas and dust clouds but display 
the dynamical properties of stellar-mass objects. Here we report observations of four 
additional G objects, all lying within 0.04 parsecs of the black hole and forming a class 
that is probably unique to this environment. The widely varying orbits derived for the 


six G objects demonstrate that they were commonly but separately formed. 


We used near-infrared (NIR) spectro-imaging data obtained over the 
past 13 years” at the W. M. Keck Observatory with the OSIRIS integral 
field spectrometer”, coupled with laser guide star adaptive optics 
wave front corrections”. OSIRIS data-cubes have two spatial dimen- 
sions—about 3 arcsec x 2 arcsec surrounding Sgr A* with a plate- 
scale of 35 mas—and one wavelength dimension that covers the Kn3 
band, 2.121-2.229 um, with a spectral resolution of R ~ 3,800. We 
selected 24 data-cubes based on image quality and signal-to-noise 
ratio; see Methods section ‘Observations’. These cubes were processed 
through the OSIRIS pipeline”. We also removed the stellar continua 
to isolate emission features associated with interstellar gas (Meth- 
ods section ‘Continuum subtraction’). The reduced data-cubes were 
analysed with a three-dimensional visualization tool, OsrsVol*, that 
simultaneously displays all dimensions of the data-cube. This helps 
disentangle the many features of this crowded region, which are 
often superimposed in the spatial dimension but are separable in the 
wavelength dimension (Fig. 1). 

Analysing the data with OsrsVol as well as conventional two- 
dimensional (2D) and one-dimensional (1D) tools, we identify four 
new compact objects in Brackett-y line emission (Bry; 2.1661 pm rest 
wavelength) that consistently appear in the data across the observed 
timeline. In addition to Bry, all four objects show two [Fe III] emission 
lines (at 2.1457 um and 2.2184 um; ref. !°). 

The four objects show many properties in common with G1 and G2 
(compact Bry emission and coherent orbital motion) and we therefore 
name them G3, G4, G5 and G6. G3 was previously identified (D2””). 
For this work we independently identified G3 in Bry emission, and G4, 
G5 and Gé are newly reported. Recently, G6 has been independently 


examined’’, and interpreted as a bow shock source rather than a 
G object. We estimate that we are able to detect G objects having Bry 
flux densities of at least 0.02 mJy, if they lie in a non-confused location. 

Several other infrared-excess sources have been identified with L’ 
and K’ observations (central wavelengths of 2.2 1m and 3.8 pm, respec- 
tively”””, see Extended Data Fig. 1). We donot include these other sources 
in this work (except for G3/D2), either because they lie outside the 
OSIRIS field of view, or because they have not been detected in Bry, 
or because they have not been consistently detected throughout the 
13 years of data. We use Keck/NIRC2 L’ imaging data to investigate 
whether G3, G4, GS and G6 have detectable L’ counterparts, as Gland 
G2 do (Methods section ‘L’ detection analysis’). No L’ counterpart was 
detected for G4, G5 and G6, with upper limits to the flux density of 
0.4 mJy, 0.6 mJy and 0.5 mJy, respectively. G3 is detected in L’ witha 
dereddened flux density of 2.5 mJy, consistent with a previous report”. 

None of the G objects was detected in the K continuum. Our 
detection limit in the K continuum is 0.01 mJy in the OSIRIS spectra 
(Kn3 filter) and inthe K’ broadband (2.12 um central wavelength) a limit 
of 0.07 mJy was reported for G2” (but see ref. ”°). 

The Bry emission is a key defining feature of the G objects because 
it probably results from external ionization and does not depend on 
the mass of a putative central object, and hence its presence is inde- 
pendent of the nature of the G objects (low-mass cloud or extended 
stellar-mass object). The compactness of such emission is what distin- 
guishes the G objects from other presumably short-lived gas blobs that 
have become detached from larger-scale interstellar structures. The 
dust heating can be attributed to some combination of the external 
radiation field and an internal stellar core, if present. Therefore, the 
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Fig. 1/2006 OSIRIS data-cube visualized with OsrsVol. The spatial 
dimensions (x-) cover the OSIRIS field of view. The wavelength dimension is 
centred around Bry (+1,500 kms“). G3, G4 and Gé are blueshifted, whereas G5 
is redshifted. Gland G2 are not visible here because they have larger velocities. 
The extended emission in the middle is near the rest wavelength and it arises 
from foreground or background gas (‘superimposed extended emission’). 

The emission extending the full length of the wavelength axis at a few positions 
(‘stellar continuum residuals’) is associated with continuum subtraction 
residuals. For this analysis, we only use sources that appear throughout the 
observed timeline. 


lack of detection of the G objects in the L’ band does not necessarily 
have implications for the existence of astellar object embedded within 
the ionized external envelope. 

The proper motions of the G objects were determined from the Bry 
centroid inthe OSIRIS data (Methods sections ‘Aligning OSIRIS epochs’ 
and ‘Astrometric measurements and uncertainty’). We furthermore 
determined the radial velocity of each object by extracting its spec- 
trum over a1.5-pixel-radius aperture on each data-cube and perform- 
ing a Gaussian fit to the Bry profile (Methods section ‘Radial velocity 
measurements and uncertainty’). 


All G objects show large proper motions and have substantial radial 
velocity shifts; the radial velocity of G3 changed by about 300 kms 
over 13 years (see Fig. 2b and Methods section ‘Radial velocity measure- 
ments and uncertainty’). 

Using these measurements (Extended Data Table 2), we determined 
the orbits of the new G objects with a Keplerian model, using a fitting 
algorithm” (Methods section ‘Orbit fitting’) with six orbital parameters, 
two parameters accounting for systematic errors in both astrometric 
positions and radial velocities, and one parameter accounting for corre- 
lation within the astrometric measurements. The black hole parameters 
(mass and Galactic Centre distance) are considered fixed". The best-fit 
orbits are illustrated in Fig. 3 and the orbital parameters are reported in 
Extended Data Table 3. These fits indicate that: (1) G3, G4 and G6 have 
orbits with modest eccentricities (e ~ 0.15, 0.3 and 0.3, respectively), 
whereas GS has a very eccentric orbit (e = 0.9); (2) the orbital periods 
range between 170 years (for G3) and 1,600 years (for G5); (3) all orbits 
lie on different planes, none of which contains Gl and G2 orbits or the 
clockwise stellar disk”; and (4) the orbits all have periods much longer 
than the 13 years of observations, which implies a small orbital phase 
coverage (~9% and ~2% in true anomaly for G3 and GS, respectively). 
We have run coverage tests to assess the bias attributable to the low 
phase coverage (Methods section ‘Dependence on priors’) and the 
results show that the obtained orbital parameters are not significantly 
biased (consistent with an unbiased result to within lo). 

We used a Gaussian fit to the Bry and the brightest [Fe 111] line 
(2.2184 um) profiles to extract fluxes. There is no noticeable flux vari- 
ation for any of the four newly reported G objects in the 13 years of 
observations (Methods sections ‘Flux calibration’, ‘Flux measurements’ 
and ‘Flux and FWHM summary table’). Nor can we detect any variation 
inthe line width, given the variations in the data quality, instrumental 
upgrades and the emission line blending with other features. 

Our analysis shows that the new objects show many of the same char- 
acteristics as G1 and G2, enough to justify defining them as members 
of acommon new class. We define the G objects to have the following 
characteristics: (1) presence of a distinct source of Bry emission; (2) 
spatially compact emission; (3) relatively weak K-band continuum 
emission (such that kK’ - L’> 4.5); and (4) large proper motion and radial 
velocity shifts over time. By ‘compact’ we mean that they are unresolved 
(<0.03”) or slightly resolved (-0.05”). 
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Fig. 2| Proper motion and spectrum of the G objects. a, Observed proper 
motions (with error bars showing standard deviations) of G objects and SO-2 on 
the plane of thesky. RA, right ascension; dec., declination. b, G3 spectrum 
(black) and Gaussian fit to the G3 Bry emission (red) in each year. There is no 
detected variation in the line width, but the G3 emission line blends with 
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neighbouring features as it changes radial velocity. The large changes inthe 
radial velocity of G3 contrast with the static extended foreground (or 
background) emission at the rest velocity (‘superimposed extended 
emission’). G objects have Bry emission, large proper motion and radial 
velocity shift, and are not detected inthe K continuum. 
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Fig.3| Orbits of the G objects. a, Orbit models in three dimensions: the portion 
of the orbit behind the plane of the sky containing Sgr A* is represented asa 
dashed line, and the part between the observer and Sgr A* is represented asa 
solid line. The thick short line indicates the time span of the observations 
(2006-18) and it gets darker and thicker in the direction of the object’s motion. 


These characteristics distinguish G objects clearly from normal stars. 
In general, the G objects seem to have very red K- L colours (K’-L’>5.2, 
6 and 4.5 for G1, G2 and G3, respectively’’), indicating that they are 
probably enshrouded by dust. 

There are also some differences and peculiarities: G3, G4, GS and 
G6 are all brighter in Bry than G1 and G2 by about a factor of 2. G1, G2 
and G3 have a clear L-band counterpart, unlike G4, G5 and G6. G3, G4, 
G5 and G6 show [Fe III] emission, whereas Gland G2 donot. G3 and G4 
are unresolved, while G5 and Gé are slightly extended (Fig. 3 inset). G1 
was extended after periapse’, as G2 was before and after periapse (but 
reverted to being compact!”). Despite these differences, the shared 
properties of the G sources warrant their aggregation into a new class 
with an appreciable population. 

G2 was originally interpreted as an ionized gas cloud? and later it 
was argued that G1 and G2 were knots within a common orbiting fila- 
ment”. However, this interpretation cannot apply to the newsources as 
they have completely different orbits. Gland G2 have remained intact 
after passing through periapse and, whereas G2 clearly underwent 
tidal interaction during its periapse passage”, its dust component 
has remained unresolved. This has led several authors®”*>~ to suggest 
that there might be astellar core shielded by an extended envelope of 
gas and dust. The star needs to have a relatively low mass (less than a 
few solar masses”) in order to be compatible with the weakness of the 
stellar continuum. 

Several models (Methods section ‘G-object formation scenarios’) 
have been proposed to account for G2 in terms of an optically thick 
distribution of dust surrounding a star: a young, low-mass star 
(T Tauri star) that has retained a protoplanetary disk” or that generates 
amass-loss envelope”, or the merger of a binary system®””>’8, 

The binary merger hypothesis (in which the influence of the black 
hole enhances the probability of a merger through eccentricity oscilla- 
tions”’) can also account for the presence of a population of G objects 
by interpreting them as relatively long-lived, distended post-merger 
objects. Assuming the binary merger hypothesis, we have used the 
number of observed G objects to estimate the required binary fraction?” 
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Allorbits have different inclinations, eccentricities and periods. b, Contour 
plots of intensity (in the Kn3 band) of the new G objects in 2018, their orbits 
(dashed grey lines) and the point-spread function (PSF). All G objects are 
unresolved or only marginally resolved. 


in the central 0.1 pc, obtaining a lower limit of about 5% for low-mass 
stars (Methods section ‘Binary fraction estimate’). This is compatible 
with the expected binary fraction*!, based on dynamical simulations”° 
and taking into account the physical characteristics of the Galactic 
Centre. In the most likely scenario” for the merger hypothesis, the 
original binaries would have been formed in the last major star forma- 
tion event at the Galactic Centre (4-6 Myr ago”’). 

Therefore, the binary merger hypothesis offers a compelling expla- 
nation for the origin of the population of G objects for several reasons: 
(1) it fits well with the three-body dynamics that are necessarily at play 
ina dense stellar environment; (2) it is compatible with the observed 
wide range of G-object eccentricities”’; and (3) it fits well with the known 
star formation history and observed stellar population. 

The random distribution of the orbital planes and the broad 
range of eccentricities of the G objects very closely resemble the char- 
acteristics of the orbits of the S stars, which more or less occupy the 
same volume. In all of the star-centred hypotheses for the G objects, 
the stellar object must have a relatively small mass (less than a few solar 
masses). At present, in the central parsec, we can directly detect stars 
with masses downto -1.5 solar masses”. Therefore, the G objects could 
be offering a unique window onthe low-mass, currently undetectable, 
part of the S-star cluster. 
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Methods 


Observations 

The observations were carried out with OSIRIS LGSAO covering 13 years, 
as tabulated in Extended Data Table 1. For each epoch of observations, 
the OSIRIS configuration with a plate-scale of 0.035” per lenslet was 
used with the Kn3 (2.121-2.229 tum) bandpass. A dither sequence with 
900s per integration using a square box pattern centred on Sgr A* with 
1.0 arcsec spacing was employed to increase the field of view and to help 
average-out systematic instrumental features. The data were reduced 
using the OSIRIS data reduction package, DRP™“. The DRP produces 
a wavelength-calibrated data-cube with two spatial dimensions and 
one spectral dimension, with the dither sequence median combined 
into a mosaic. 


Continuum subtraction 

In order to extract the emission line of the interstellar medium we need 
to remove the continuum emission coming from the numerous stars in 
the field. To doso, we selected several spectral ranges devoid of spectral 
features. These spectral ranges are the same for all epochs and they 
are chosen to optimize the continuum estimation across the field and 
across the spectral band. Afterwards, we model the continuum pixel-by- 
pixel using a spline function. The continuum subtraction is somewhat 
more complex at the edges of the filter’s band but this does not affect 
our measurements: the continuum around the emission line closest to 
the edge of the band that we are considering, [Fe 111] 2.2184 pm, is still 
well modelled. We then produce new data-cubes in which the modelled 
continuum has been subtracted from each spectrum and use those for 
the rest of the analysis. 


L’ detection analysis 

The Galactic Center Group has gathered L’ (at 3.8 pm) imaging data 
in the L’ bandpass (at 3.8 pm) with the NIRC2 imager at the W. M. Keck 
Observatory over several of the same epochs observed by OSIRIS and 
used in this study. These data were analysed to determine whether 
there are L’ sources coincident with the OSIRIS-detected Bry sources 
via the PSF-fitting tool StarFinder”. We chose the deepest L’ epoch 
(2012.551”) to search for coincident L’ sources. 

No L’ counterpart was detected for G4, G5 and G6 and we perform 
star-planting simulations to determine an upper flux limit. We used the 
Bry positions of the sources and transformed them into the 2012.551 
L’ coordinate system using a series of linear transformations that take 
into account stretching, linear offsets, and rotation. For each source, 
neighbouring L’ sources were subtracted out using the flux values 
identified with StarFinder. K’-identified sources that were not associ- 
ated with the L’ sources based on proper motions were also subtracted 
from the analysis image assuming that they had the same magni- 
tude and colour profiles as our flux calibration sources (SO-2, SO-12, 
S1-20 and S1-1°”). The images were then background-subtracted and 
Lucy-Richardson deconvolved using the background map and model 
PSF generated from StarFinder. We deconvolved for 8,196 iterations 
and re-convolved each image with a 3-pixel full-width at half-maximum 
(FWHM) two-dimensional Gaussian PSF. Point sources of varying magni- 
tude were planted in the image at the positions of G4, G5 and Géat vary- 
ing magnitude until they could no longer be detected with a modified 
version of StarFinder?’’, These magnitudes were then corrected for 
Galactic Centre extinction* and converted to flux densities. The L’ flux 
density values for G4, G5 and G6 represent upper limits, but the G5 value 
may still be contaminated by structured background in that region. 
The flux density values for G3 are consistent with previous reports”. 
All flux densities are reported in Extended Data Table 4. In all the above 
analyses, the single PSF model generated by StarFinder is adequate to 
use in this case as the off-axis positions of the candidate G sources do 
not experience a strong effect of the field-dependent PSF. A by-eye 
search for G4, G5 and G6 was performed using the L’ data coincident 


with the other OSIRIS epochs, but no sources were cleanly identified as 
being associated with the three candidate G sources. All deconvolved 
images in the L’-coincident epochs are shown in Extended Data Fig. 2. 


Aligning OSIRIS epochs 

For our measurements and analysis, we used 24 epochs of OSIRIS obser- 
vations. Each epoch consists of a mosaic constructed from frames that 
have been observed while dithering around the position of the star SO-2. 
The mosaic is obtained through a median-combine procedure applied 
through the OSIRIS DRP™®. In order to extract the astrometry of theG 
objects, we shifted all mosaics into acommon reference frame. To do 
so we measured the position of two reference stars: SO-12 and SO-14. 
The choice of these two specific stars was made because they are rea- 
sonably well-isolated in this crowded field, they are reasonably bright 
(for SO-12 K = 14.3, and for SO-14 K ~ 13.7; ref. ””), and they are close to 
the observed G objects, thereby minimizing possible systematics in the 
alignment procedure due to distortion. We have accurate knowledge of 
the orbital motions—and thus astrometric positions—of these two stars 
with respect to Ser A* from previous publications”. Taking into account 
the reference stars’ motions we can put all observations inacommon 
reference frame with Sgr A* at the centre. However, this assumes there 
is no significant differential distortion from epoch to epoch and that 
Sgr A* does not move. Given the small field of view covered by OSIRIS 
at this platescale, any differential distortion should be insignificant. 
The main source of uncertainty in this procedure comes from the cen- 
troid of the G objects. On the other hand, the position of the reference 
stars is very well measured because of the very high signal-to-noise. We 
also consider an additional systematic uncertainty onthe astrometric 
position in the orbital fit (Methods section ‘Orbit fitting’). 


Astrometric measurements and uncertainty 

Analysis of the proper motion of the G objects was performed using two 
sets of cubes: those that had been processed to remove the continuum 
(Methods section ‘Continuum subtraction’) and those with the stellar 
continuum included. The G sources do not have a continuum detec- 
tion in the Kn3 bandpass and thus we used the continuum-subtracted 
cubes to measure their positions. The positions of the G objects were 
measured ina median-collapsed 2D image produced by combining five 
spectral channels centred on the peak wavelength of the Bry emission 
from eachG source for each epoch. The peak-fit IDL routine was used to 
measure the X-Y position in each cube. The X-Y positions were trans- 
formed into RA-dec. coordinates relative to Sgr A* using the positions 
for SO-12 and SO-14 to establish the frame of reference. 

S0-12 and SO-14 are stellar sources with well-established position off- 
sets from Sgr A*, they are relatively isolated spatially, and their motion 
onthe plane of the sky is relatively small over this time frame. The stellar 
positions were measured using the IDL peak-fit routine of a median- 
combined 2D image produced from collapsing the spectral dimension 
of the cube over the range of 2.133-2.158 um (corresponding to chan- 
nels 50-150). This wavelength range was chosen because it is a clean 
part of the spectrum that avoids emission, stellar absorption and 
atmospheric absorption features. The stellar point sources were 
mapped to acoordinate system in which Sgr A*is at rest*°”’. The errors 
of the position measurements were estimated using a Monte Carlo 
method with many trials of centroid measures over variable aperture 
size and position. The measurements are reported in Extended Data 
Table 2. 


Radial velocity measurements and uncertainty 

The spectra for each G object were extracted from the continuum- 
subtracted data-cubes. To extract a 1D spectrum for the purpose of 
measuring radial velocity, the intensity of each Bry emission feature 
was measured at each spectral channel of the data-cube, summing 
over a1.5-pixel-radius aperture centred on the peak position of the 
emission feature. 
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The emission-line profile was analysed using a Gaussian fitting rou- 
tine on the emission feature. The fits were performed ona wavelength 
range that isolates the feature under study as muchas possible from 
other nearby emission features, such as the ambient gas and other 
G objects. The Gaussian parameter fit yields the central wavelength of 
the Bry emission line, from which the radial velocity can be calculated 
relative to the local standard of rest. 

Extended Data Fig. 3 displays the extracted spectrum and Gaussian 
fit for each object as it progresses over time. Changes in radial veloc- 
ity over the 13-year period are evident for each G object. The velocity 
measurement errors were computed using the statistical errors of the 
Gaussian fit. In addition to the detection of Bry emission, G3, G4, GS 
and G6 display [Fe 111] emission at the same Doppler shift, as shownin 
Extended Data Fig. 4, which displays the full Kn3 bandpass spectra from 
2006-combined data sets for G3, G4, G5 and G6 (as well as Gland G2 for 
comparison). G3, G4, G5 and G6 have [Fe 111] detections at 2.2184 um 
and 2.1457 pm (ref. °), whereas Gland G2 show only Bry emission. None 
of the G objects shows H, emission (2.1220 1m), although H, is evident 
in the ambient background material near zero velocity. The measure- 
ments are reported in Extended Data Table 2. 


Orbit fitting 

The astrometric and radial velocity measurements (Extended Data 
Table 2) are combined in a global orbit fit. The software used for orbit 
fitting has been previously used for the detection of the relativistic 
redshift on SO-2". The orbital modelling assumes Keplerian motion 
parameterized by the six following orbital elements”: the period (P), 
the time of closest approach (7,), the eccentricity (e), the inclination 
(i), the argument of periastron (@) and the longitude of the ascending 
node (Q). The G objects do not have enough orbital coverage and 
information to constrain the parameters related to the central mass 
(the mass of the black hole, the distance to our Galactic Centre Ro, 
the position and velocity of the central mass). Therefore, we fixed the 
values of the black hole mass and R, to the ones obtained from SO-2’s 
measurements", that is, to M = 3.964 x 10°M, (where M, is the solar 
mass) and Ry = 7.971 kpc. 

Our orbital fits are performed using Bayesian inference with a Multi- 
Nest sampler*®”. The radial velocity measurements are assumed to 
be independent and normally distributed. To take into account pos- 
sible systematics at the level of the orbital fit, we use a likelihood 
that includes a systematic uncertainty (gy) for the radial velocities. 
Insummary, the radial velocity (RV) measurements are assumed to be 
distributed following: 


RV;-NRV(t), oy, + oRv | 


where RV((¢) are the predicted radial velocity values, ogy are the meas- 
urement uncertainties and where x~N[p, 07] denotes that x is 
normally distributed around p with a variance of o”. On the other hand, 
the astrometric measurements are assumed to be correlated, that is, 
the likelihood is assumed to be a multivariate normal distribution char- 
acterized by a covariance matrix. In addition, to take into account pos- 
sible systematics at the level of the orbital fit, we also include an 
additional parameter: a systematic uncertainty for the astrometry, 
Oastro: Lhe astrometric measurements are therefore assumed to be 
distributed as: 


xX~ NEx(tastro), xy] and y~ N[V(tastro), Zy] 


where X(tastro) ANd Y(tastro) are the predicted astrometric values, J, and 
2, are the covariance matrices, and x ~ N[p, 2] denotes that the vector 
xis normally distributed around the vector pt witha covariance matrix 
of . We model the covariance matrices" by: 


= 2 2 2 2. 
Ey]y = [ply sf Oa + Oastro [%, + Oastro 


= 2 2 2 2 
yy = [ply,|03, + Oastro [0% + Oastro 


where O,¢ is the systematic uncertainty and pis the correlation matrix 
that characterizes the correlation of the measurement errors. This 
correlation matrix is given by”: 


[ply = (1-06; + cel au" 


where 6, is the Kronecker delta and d,is the 2D projected distance 
between point iand point/: 


d= (4-4) + 0, -y)? 


Here A is a correlation length scale that typically takes the value of 
half the diffraction limit of the detector", and is fixed here at a value 
of 35 mas; cis a mixing parameter that is fitted simultaneously with 
all parameters and that characterizes the strength of the correlation. 
Corner plots of the best fit are shown in Extended Data Fig. 5 and the 
best fit parameters are reported in Extended Data Table 3. 

In addition, we use uniform priors onall fitted parameters. We show 
in the next Methods section that this does not bias our estimates. 


Dependence on priors 

To estimate the orbits of the G objects, we use uniform priors onall eight 
fitted parameters (six orbital parameters and two systematic uncer- 
tainty parameters). Although uniform priors are commonly assumed 
in orbit fitting, this choice has been shown to cause potential biases in 
estimated parameters when orbital periods are much longer than the 
time baseline of observations‘®*!. To assess the impact of our fitting 
procedure inthis context, we ran simulations to assess possible biases 
in the estimated parameters and to test the accuracy of confidence 
intervals obtained in our analysis“. We generated 100 mock data sets 
with simulated measurements at epochs corresponding to our obser- 
vations. The simulated measurements were randomly drawn froma 
normal distribution about an assumed ‘true’ value, with a dispersion 
equal to the true measurement error at that epoch. We fit each of these 
100 mock data sets with the same orbit fitting procedure as described 
above. The bias on each fitted parameter is computed from the differ- 
ence between the estimated parameter value and the input parameter 
value, normalized by the loconfidence interval on the corresponding 
parameter. For all eight fitted parameters, the distribution of bias values 
is centred around zero for G3, G4, G5 and Gé within the 68% confidence 
interval, indicating non-biased parameter estimates. 

In addition, we evaluate statistical efficiency to demonstrate that 
the confidence intervals used in this analysis are well-defined and 
have close to exact coverage. According to the classical definition of 
aconfidence interval, loconfidence intervals inferred from each orbit 
fit should cover the ‘true’ value (from the simulated data) 68% of the 
time. In other words, given 100 randomly drawn simulated data sets, a 
68% confidence interval requires that about 68 out of 100 fits produce 
a confidence interval that covers the true value”. However, effective 
coverage (defined as the experimentally determined percentage of data 
sets in which the inferred confidence interval covers the true value) 
is rarely exact. Statistical efficiency, defined as the ratio of effective 
coverage to stated or expected coverage (for example, 68% for a locon- 
fidence interval), is thus a powerful performance diagnostic that can be 
used to investigate the accuracy of calculated confidence intervals*.. 
By definition, a statistical efficiency of one indicates exact coverage. 
The statistical efficiencies for all parameters for G3, G4 and G6 are 


consistent with one. For GS, the period is slightly under-covered with 
astatistical efficiency of 0.81+ 0.09, indicating that the inferred confi- 
dence interval on GS’s period is slightly underestimated. The statistical 
efficiencies for all other parameters for G5 are consistent with one. 
This analysis indicates that, in general, confidence intervals calculated 
in this work provide robust estimates of the statistical uncertainty. 


Flux calibration 

For this project we perform the absolute flux calibration of OSIRIS data. 
To doso, we need to apply aperture photometry to isolate sources of 
known magnitude. Even though many of the stellar sources are well- 
known, the Galactic Centre is a very crowded environment: no source 
is truly isolated and the combined background of underlying sources 
is challenging to determine. To measure the flux of stars on the field 
we would need to use a very small aperture radius. However, the PSF 
cannot be easily modelled, since the observations are taken through 
adaptive optics. Moreover, the OSIRIS field of viewis very small, making 
an accurate empirical knowledge of the PSF impossible. 

Instead we use observations of standard A stars, obtained the 
same night as the Galactic Centre observations. We used: HD 155379, 
HD 195500 and HD 146606 with 2MASS K magnitudes of 6.52, 7.19 and 
7.04 respectively. These stars are chosen to be at around the same air- 
mass as the science targets and their observations are taken as close 
in time as possible to the science observations (within a few hours). 
These are well-known, bright and isolated sources for which we can 
use aperture photometry over a very large radius that encompasses 
almost all of the source. In this way we can gather close to 100% of the 
flux and avoid problems related to the PSF shape. 

The A-star frames are obtained by dithering around the star’s posi- 
tion and are treated with the standard calibration procedure to remove 
atmospheric effects. Here, for each epoch, we use all available frames 
independently to measure the counts-to-Jy conversion factor and use 
their dispersion to estimate the corresponding uncertainty. Both the 
science mosaic and the A-star frames are calibrated in the standard way 
of the group. For each epoch, for each frame, we perform a 2D Gaussian 
fit to get the centroid of the source and an estimate of the Gaussian o. 
We extract the A-star flux (F,,) within a -12-pixel aperture radius, which 
is ~6 times the o of the 2D Gaussian fit (that is, that encompass ~100% 
of the stars’ flux). We subtract the sky background through an annulus 
1 pixel larger than the aperture size and of 1-pixel thickness (F,,,). We 
use the known magnitude of the star (from the 2MASS catalogue) to 
compute its expected flux in the Kn3 band (F,,,) using Vega as zero-point. 
The conversion factor is computed as follows: 


Ce 

Fap~ Fan df 

where dfis the width of the spectral channel in hertz. The same process 
is repeated for all frames within one given epoch and the median is 
adopted as the value for that epoch and the dispersion as the uncer- 
tainty. We checked other potential sources of error, such as imprecise 
pointing on the centre of the star, but we always obtained uncertain- 
ties several orders of magnitude smaller than the one coming from 
the dispersion. 

The disadvantage of not using sources within the science field for 
calibration is that there could be variations of the fraction of photons 
reaching the detector surface between the science target and calibra- 
tor observations—for example, because of variations inthe extinction 
due to passing clouds at the telescope site. However, the variation in 
extinction due to clouds is usually less than 0.5 mag and should have an 
impact smaller than the final calibration error. Indeed, the final calibra- 
tion factor does not vary much from night to night or even year to year. 
The most dramatic variations are related to instrumental hardware 
upgrades. Therefore, we have chosen to divide the OSIRIS instrument 


timeline into three parts”: (1) 2006-12 before the grating upgrade; 
(2) 2012-16 before the spectrograph upgrade; and (3) from 2017 on. 

For each of these periods, we consider the median of the conversion 
factors as the final value and the dispersion of the measurements as 
its uncertainty. This way we obtain three calibration factors with an 
error of about 10%. 

We also compare the conversion factor obtained with the A stars 
to the one obtained using multiple stars on the field. In the case of 
the field stars the values are very sensitive to the applied correction 
to the aperture flux, and the conversion factor therefore varies more 
dramatically (even within close epochs) than in the case of the A stars 
(Extended Data Fig. 6). Therefore, we can affirm that the flux calibra- 
tion obtained through standard stars is more robust. 


Flux measurements 

In order to maximize the signal-to-noise ratio, we measured the flux 
ondata-cubes combined year by year (hence1 cube per year). Multiple 
datasets were combined for each year using all available epochs to 
enhance the signal-to-noise ratio in the emission lines, resulting in 11 
data-cubes corresponding to 11 years of data taken between 2006 and 
2018 (except for 2007 and 2016, where the image quality was too poor 
and no Kn3 data cube was obtained, respectively). 

The Bry line fluxes of the G objects are obtained for each combined 
data-cube by extracting its spectrum and performing a Gaussian fit 
to the emission line. Flux measurements were derived from each line 
profile using an equivalent width method. The equivalent width was 
computed from the Gaussian fit parameters of the emission features 
from Bry and [Fe 111]. A conversion from measured flux to Wm was 
established using the flux calibration performed for each epoch from 
observations of A standards (Methods section ‘Flux calibration’). 
Note that the absolute flux calibration can have relatively high errors 
in AO systems where the image quality and encircled energy in the 
data collection can change substantially on short timescales and from 
night to night. The measured fluxes are dereddened*. The measure- 
ments are reported in the following Methods section and in Extended 
Data Table 4. 


Flux and FWHM summary table 

The measured flux densities for all objects are reported in Extended 
Data Table 4, along with measurements of the spatial and spectral 
width. We do not detect any continuum in the Kn3 band in any of the 
Gsources (we find a detection limit of 0.01 mJy). However, G2 detection 
in K-broadband imaging data has been claimed”, finding a dereddened 
flux of about 0.25 mJy in K, (2.18 pm central wavelength), which com- 
pares to a detection limit of 0.07 mJy in the K’ band” (2.12 um central 
wavelength). 


G-object formation scenarios 

Although many hypotheses have been proposed to explain the origin of 
Gland G2, the principal debate has centred on whether they are com- 
pact, dusty gas clouds or gaseous features anchored onstellar cores. G1 
and G2 were first interpreted as purely gas and dust clouds®. However, 
Gland G2 have remained intact after passing through periapse, which 
has led some®**” to argue that they must have a stellar core shielded 
by an extended opaque envelope of gas and dust. 

Given the absence of photospheric emission, the original G2 hypoth- 
esis interpreted it as an ionized gas cloud of 3 Earth masses. Since its 
discovery, the gas has been tidally interacting with the black hole. It was 
argued” that Gland G2 are knots of gas and dust that have formed 
within acommon orbiting filament. Indeed, their orbits are similar, 
but substantially different’. A drag force has been invoked to explain 
this difference’®. However, the common filament interpretation can- 
not apply to the new sources we present here because of their very 
different location and orbit. 
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Given the strong tidal forces near the black hole, and the high flux of 
ultraviolet radiation in this region, compact gas clouds would suppos- 
edly be transient phenomena, unless they could be stably confined bya 
high external pressure”. Otherwise, they would need to be continuously 
produced in order to account for the sizable population we observe. 
The region is richin gaseous interstellar medium structures, including 
the Epsilon source“ (a nearby feature immediately west of the field), 
the Minispiral*® and the Circumnuclear Disk“. It is possible that small 
pieces of these larger structures get detached and stay in the region 
for afew decades before getting destroyed, but itis not clear that such 
gas blobs would be as compact as the observed G sources. 

The alternative hypothesis is that the G objects host a star. Whereas 
G2 is tidally interacting during its closest approach to Sgr A*°, the 
dust component of G2 has remained unresolved. The emitting gas is 
unbound at closest approach”, but that is not inconsistent with the 
existence of a stellar mass keeping the dust emission compact®. 

Several models have been proposed to account for G2 in terms of 
an optically thick distribution of dust surrounding a star: a young, 
low-mass star (T Tauri star) that has retained a protoplanetary disk”® 
(scenario 1) or that generates a mass-loss envelope” (scenario 2); or 
alternatively, the merger of a binary system®”>”s (scenario 3). 

In the first scenario, G2 could be a young star that has retained a 
protoplanetary disk and that was scattered inwards from the mas- 
sive cluster of young stars distributed on larger scales”. Stars having 
protoplanetary disks are common in young clusters, but it is unclear 
whether such disks would survive the abrupt scattering event needed to 
transfer the protostars onto such tight orbits around the black hole. Fur- 
thermore, protoplanetary disks do not last very long except under the 
most benign conditions (up to 5-7 Myr; ref. *”), therefore a population 
of such objects in the particularly hostile Galactic Centre environment 
isnot obviously compatible with the timescale of the last star formation 
event (4-6 Myr ago”). Therefore, the protostellar disk hypothesis might 
be ruled out as an explanation for the common origin of these objects 
unless star formation is continuous at the Galactic Centre, as some 
have argued*®”’. This matter is still under debate, but any demonstra- 
tion that a substantial number of protoplanetary disks have survived 
in the central 0.05 pc of the Galactic Centre would have important 
implications for our understanding of star formation in this region. 

In the second scenario, G2 was proposed to be the product of the 
mass-losing envelope of a young, low mass, T Tauri star. One open 
question is whether the observed Bry emission is caused by collisions 
or ionization by Galactic Centre stars. In the case of emission by col- 
lisions the emission is unrelated to the G objects being located in the 
vicinity of the black hole, which raises the question of why these objects 
have not been seen elsewhere. 

In the third scenario, G1 and G2 are proposed to be binary merger 
products. The influence of the black hole will enhance the probability 
that binary systems merge through eccentricity oscillations due tothe 
eccentric Kozai-Lidov (EKL) mechanism”. The merging process would 
inflate the outer layers of the merging binaries, which would produce an 
extended envelope of dust and gas around the merger product, hiding 
the central mass for an extended period of time. A few binary mergers 
are knowninthe Galaxy” ©”. However, such mergers took place recently 
and were discovered because of the strong variability that probably 
characterizes the early stages of a merger. According to the merger 
hypothesis, the G objects are more likely to be ina much quieter long- 
term phase in which the merger has stabilized and is evolving slowly on 
a Kelvin-Helmholtz timescale. For this reason, it is not meaningful to 
compare the G objects to presently known mergers, especially because 
we still have scant quantitative knowledge of how a merger evolves. 

The binary merger hypothesis could offer a mechanism to rejuvenate 
stars in the Galactic Centre, as in the case of blue stragglers” (but see 
ref. °): some of the observed young stars orbiting closely around the 
central black hole (the S stars) could be the product of the merger of 
older stars. However, it is unclear whether this process can produce 


sufficiently massive stars to account for theS stars (typically (10O-30)M.; 
ref.”). The new star resulting from a merger can appear to be froma few 
Myr to several Gyr younger, depending onthe merging circumstances”. 

Even ifthe G objects cannot account for the origin of the S stars, they 
are possibly connected to them. Here we have shown that the orbits of 
G3, G4, G5 and G6 have very different inclinations. This random distri- 
bution of the orbital planes very closely resembles the distribution of 
the orbits of S stars. Ifa stellar object is hidden inside a G object it must 
have a relatively small mass (less than a few solar masses), given the 
weakness of the continuum emission from these objects. Inthe central 
parsec, given the kK’ detection limit”, we can detect stars with masses 
down to -1.5M, (ref. °°). However, we could detect low-mass binary 
systems that merge, producing a shell of dust and gas: gas would be 
ionized by the environmental radiation, whereas dust would be heated 
by both environmental radiation and the luminous energy emerging 
from the interior of the G object. The G objects could therefore offer 
aunique window on the low-mass, currently undetectable, part of the 
S-star cluster. 

As a consistency check, we investigated whether the number of 
observed G objects is consistent with the expected number of binary 
mergers (see the following Methods section). 

The EKL-induced binary merger hypothesis offers a compelling 
explanation for the origin of G objects that fits well with the three-body 
dynamics that are necessarily at play in a dense stellar environment, 
with the third body being a supermassive black hole. Moreover, a wide 
range of eccentricities is expected for such binary merger products”, 
in agreement with what we observe. 


Binary fraction estimate 

To estimate the binary fraction from the current number of G objects, 
we assume that all observed G objects are binary merger products 
(indeed we expect a large fraction of binaries in the Galactic Centre 
based on the orbital configuration of the stellar disk*”). We assume 
that all six of the G objects discussed here are relatively recent binary 
mergers, and that their progenitor binary systems were formed inthe 
latest known star formation event 4-6 Myr ago. This assumption is sup- 
ported by the fact that older binaries can only survive in the Galactic 
Centre ifthey are very tight, and therefore havea very low probability 
of merging®°. They would consequently not contribute substantially 
to the observed population of G objects. We use binary merger rates” 
and the initial mass function”. Given the absence of continuum emis- 
sion, we assume the G objects come from only the low-mass part of the 
population. Therefore, the binary fraction of low-mass stars is given by: 
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where N, is the number of binaries and N,, the number of low-mass 
stars. We should expect about 10% of all binary systems to have merged 
within a few million years froma given star formation event in the Galac- 
tic Centre®°. Also, 20%-25% of the initial binary population will have 
evaporated within the first few million years. So, given the observed 
number of G objects in the OSIRIS field of view (N, = 6), the number of 
binaries present today is given by: 


Ng= ae - 0.25-0.1) (2) 


The initial mass function inferred by Lu et al.”’ is: 
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where €is anormalization factor. Using this we can compute the number 
of low-mass stars (1M, <M<10M.), Nin: 


{7 Nn= Si me (3) 
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Given the number of stars we detect in the OSIRIS field of view (Ny, = 64 
stars with M>10M, is the average for 2018 data), we can determine the 
normalization factor and estimate the number of low-mass stars from 
equation (3): 


30M, Mie 
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From equations (1) and (2), it follows that the current binary fraction 
isR=5%. 

Locally in the Galaxy, the binary fraction of solar-type stars” is ~50%. 
Only about 20% of such binaries—which would be stable in the field— 
can be stable in the Galactic Centre”®, which leads to 10% solar-type 
star binaries in the Galactic Centre. 35% of these binaries have already 
evaporated after formation”, resulting in a surviving binary fraction 
of 6%-7%, compatible with what we deduce from the observed abun- 
dance of G objects. 


Data availability 


All data generated or analysed during this study are included in this 
published article. 


Code availability 


The orbit fit code is publicly available at https://zenodo.org/ 
record/3305315#.XXmLPC3MZUY. 
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Extended Data Fig. 1| Infrared-excess sources and G objects. Shown are the NACOat the VLT in 2005) and” in magenta (data obtained with NIRC2 at Keck in 
proper motions of the G sources (from 2006, in blue, to 2018, in yellow) along 2005). The red trace shows the proper motion of SO-2 (from NIRC2) as 


with the positions of infrared-excess sources’ in orange (data obtained with reference. The average OSIRIS field is displayed as a dotted rectangle. 


Extended Data Fig. 2| NewG objects in the L band. We show the deconvolved L’ images from NIRC2 for several epochs (each year is reported in the top-left 
corner). The green circles indicate the position of the G objects in the Kn3 OSIRIS band and of Sgr A*. 
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Extended Data Fig. 3 | Spectra of the new G objects showing the Bry emission 
line over time. Top row, G3 (left) and G4 (right); bottom row, G5 (left) and G6 
(right). The spectra are extracted epoch by epoch (black). The Gaussian fit of 
the Bry emission line (red) is superimposed. There is no significantly detected 
variation (all values are compatible within 10) in the linewidth for any of the 
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objects. The data quality varies and the emission of the objects blends with 
neighbouring features as it changes radial velocity (RV) and position: this gives 
sometimes the impression of a broadening of the line whichis not real. The 
emission line at the rest velocity is part of the extended emission present 
across the field and does not change with time. 
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Extended Data Fig. 4 | Spectra of the G objects showing Bry and [Fe 111] and G6 show both Bry and [Fe I11] emission moving at the same velocity 
emission lines. The Kn3 spectra of G objects G6-Glare extracted over an (Doppler-shifted emission indicated by the arrows), whereas G1 and G2 show 
aperture of 1.5-pixel radius from the 2006 combined dataset. The dotted lines only Bry emission. 


show the rest-frame velocity of the Bry and [Fe 111] emission lines. G3, G4, G5 
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Extended Data Fig. 5 | Corner plots for orbit fitting of G3, G4, G5 and G6. See Methods section ‘Orbit fitting’ for details of the parameters displayed. 
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Extended Data Fig. 6 | Unit conversion factors as a function of time. a, Factor hardware changes in the instrument. We use the median value for each 
obtained using calibration A stars (single frames in grey triangles, median for instrument period (green solid line) and use the dispersion as an estimate of 
each epochinred dots, dispersion used as error bar). b, Factor obtained using the uncertainty (green dashed lines). See Methods section ‘Flux calibration’ for 
stars inthe science field. The dispersion when using stars in the science field is details. 

much larger. We use the A stars, for which most of the variation corresponds to 
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Extended Data Table 1| OSIRIS observations used for the orbital fit 


Date Used for orbit fit # of frames FWHM [mas] LSR corr [km/s] 
2006-06-18 G3 G4 G5 G6 9 65 10 
2006-06-30 G3 G4 G5 G6 9 59 4 
2006-07-01 G3 - G5 G6 9 64 4 
2008-07-25 G3 G4 G5 G6 11 60 -7 
2009-05-05 G3 G4 G5 - 12 60 30 
2009-05-06 G3 G4 G5 G6 12 69 30 
2010-05-05 G3 G4 G5 G6 6 67 30 
2010-05-08 G3 - - G6 11 69 29 
2011-07-10 G3 G4 G5 G6 6 71 0 
2012-07-22 G3 G4 G5 G6 9 81 -6 
2013-05-14 G3 G4 G5 G6 11 91 27 
2013-07-27 G3 - G5 G6 11 66 -8 
2014-07-03 G3 G4 G5 G6 9 66 3 
2015-07-21 G3 - - G6 7 56 5 
2017-05-17 G3 G4 G5 G6 11 73 25 
2017-05-18 -G4-G6 9 94 25 
2017-05-19 -G4-G6 6 86 24 
2017-07-19 G3 - - G6 12 77 -4 
2017-08-14 G3 - G5 G6 8 75 -14 
2018-04-24 G3 - G5 G6 i 73 34 
2018-05-23 - G4 G5 G6 14 91 23 
2018-07-22 -G4-G6 11 77 -6 
2018-07-31 G3 G4 G5 G6 11 73 -9 
2018-08-11 G3 G4 G5 G6 9 79 -13 


The date is reported in YYYY-MM-DD format. All frames had integration times of 900 s. The FWHM value reported is for the star SO-2. To obtain radial velocities in the local standard of rest (LSR) 
reference frame, each observed radial velocity is corrected for the Earth's rotation, its motion around the Sun, and the Sun’s peculiar motion with respect to the LSR. 


Extended Data Table 2 | Measured values of positions and radial velocities 


epoch RA err DEC err RV err epoch RA err DEC err RV err 
53904.4 0.487 0.007 -0.117 0.005 -594 2 53904.4 0.402 0.005 0.142 0.004 -293 1 
53916.3 0.492 0.009 -0.125 0.006 -600 2 53916.3 0.405 0.009 0.131 0.006 -291 2 
53917.3 0.492 0.009 -0.105 0.006 -592 2 54672.3 0.436 0.008 0.135 0.006 -304 1 
54672.3 0.469 0.009 -0.082 0.005 -555 3 54956.5 0.445 0.009 0.133 0.009 -310 1 
54956.5 0.455 0.004 -0.052 0.001 -536 2 54957.5 0.447 0.012 0.133 0.004 -313 1 
54957.5 0.458 0.004 -0.059 0.003 -531 2 §5321.5 0.458 0.012 0.126 0.006 -314 1 
55321.5 0.454 0.003 -0.035 0.003 -518 1 §5752.3 0.473 0.005 0.134 0.004 -330 1 
55324.5 0.454 0.008 -0.027 0.002 -512 2 56130.3 0.484 0.023 0.131 0.010 -334 2 
G3 55752.3 0.431 0.004 -0.019 0.010 -478 3 G4 56426.5 0.502 0.005 0.125 0.002 -345 1 
56130.3 0.425 0.005 0.002 0.004 -464 3 56841.3 0.517 0.009 0.131 0.005 -346 1 
56426.5 0.416 0.002 0.015 0.001 -450 2 57890.4 0.550 0.004 0.133 0.003 -360 1 
56500.3 0.414 0.004 0.014 0.002 -438 2 57891.5 0.554 0.012 0.128 0.029 -360 1 
56841.3 0.405 0.002 0.036 0.001 -419 1 57892.5 0.553 0.012 0.125 0.008 -360 1 
57224.3 0.393 0.009 0.062 0.005 -398 1 58261.4 0.581 0.004 0.132 0.003 -361 1 
57890.4 0.370 0.005 0.094 0.002 -324 3 58321.3 0.583 0.005 0.128 0.002 -368 2 
57953.3. 0.369 0.003 0.098 0.002 -312 2 58330.3 0.578 0.005 0.133 0.004 -363 2 
57979.2 0.362 0.005 0.102 0.003 -318 2 58341.3 0.575 0.006 0.132 0.003 -365 1 

58232.5 0.356 0.032 0.118 0.010 -298 3 epoch RA err DEC err RV err 

58330.3 0.351 0.005 0.131 0.003 -288 2 53904.4 0.19 0.03 -0.37 0.02 -210 

58341.3 0.349 0.009 0.120 0.005 -300 2 53916.3 0.18 0.04 -0.33 0.03 -207 2 
epoch RA] err DEC err RV err 53917.3 0.20 0.02 -0.35 0.01 -209 1 
53904.4 0.70 0.06 -0.38 0.05 267 2 54672.3 0.20 0.05 -0.36 0.03 -210 2 
53916.3 0.70 0.01 -0.37 0.01 281 2 54957.5 0.21 0.05 -0.37 0.02 -210 1 
53917.3 0.70 0.01 -0.36 0.01 276 1 55321.5 0.24 0.05 -0.36 0.03 -212 2 
54672.3 0.67 0.01 -0.35 0.01 296 2 55324.5 0.27 0.03 -0.37 0.02 -225 1 
54956.5 0.66 0.01 -0.34 0.00 297 1 55752.3 0.24 0.03 -0.37 0.01 -232 2 
54957.5 0.66 0.01 -0.34 0.00 304 1 56130.3 0.25 0.03 -0.38 0.01 -227 2 
55321.5 0.65 0.01 -0.33 0.00 311 1 56426.5 0.26 0.02 -0.38 0.01 -230 1 
55752.3 0.62 0.01 -0.33 0.01 311 1 56500.3 0.26 0.02 -0.38 0.01 -230 1 
G5 56130.3 0.63 0.01 -0.31 0.01 314 2 56841.3 0.27 0.01 -0.39 0.01 -229 1 
56426.5 0.61 0.00 = =-0.31 0.00 327 0 ob 57224.3 0.29 0.03 -0.39 0.02 -244 1 
56500.3 0.59 0.00 8 -0.30 0.00 329 1 57890.4 0.30 0.02 -0.40 0.01 -238 1 
56841.3 0.59 0.00 -0.30 0.00 331 0 57891.5 0.30 0.02 -0.41 0.01 -237 1 
57890.4 0.53 0.00 -0.28 0.00 352 0 57892.5 0.30 0.02 -0.41 0.01 -240 2 
57979.2 0.53 0.00 -0.28 0.00 354 1 57953.3 0.30 0.02 -0.40 0.01 -245 1 
58232.5 0.52 0.01 -0.27 0.00 360 1 57979.2 0.29 0.03 -0.40 0.01 -240 1 
58261.4 0.52 0.00 -0.27 0.00 358 1 58232.5 0.31 0.05 -0.40 0.02 -253 2 
58330.3 0.51 0.01 -0.26 0.00 363 2 58261.4 0.32 0.04 -0.41 0.02 -244 1 
58341.3 0.51 0.00 -0.27 0.00 358 2 58321.3 0.31 0.03 -0.40 0.02 -237 1 
58330.3 0.31 0.02 -0.40 0.01 -241 1 
58341.3 0.31 0.02 -0.40 0.01 -244 1 


Data are shown for G3, G4, G5 and G6. The epochs are reported as modified Julian date (MJD). The positions (RA and dec.) are offsets from Sgr A* in arcseconds. The position uncertainties are 
the standard deviation obtained through a Monte Carlo method. The radial velocities (RV) are in km s ‘and have been corrected for the local standard of reference (LSR). The reported radial 
velocity uncertainties (err.) are purely statistical (10 of the line fit). There is an additional systematic uncertainty that we fold into the orbit fit. 
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Extended Data Table 3 | Best fit orbital parameters 


P [yr] TO [yr] e i [deg] Ww Q Onses Orv ¢ 
G3 156122 202442 0.1140-08 52tts 261423 55tt8 6.7733 Totes 0.8943-08 
G4 8157199 2075*21 0.2840-07 95.4408 160738 g2*t7 5.5718 4tos 0.9673-98 
G5 13924888 206233 0.9808T 90.1434 8.837 62.9432 2.8215 4.9tht 0.424322 
G6 7324224 2211433 0.417933 83.6113 2427138 61.7158 2.4425 5103 0.9975-03 


We show the median and central 68% confidence interval of the best-fit orbital parameters for G3, G4, G5 and G6. See Methods section ‘Orbit fitting’ for nomenclature. 


Extended Data Table 4 | Emission from the G objects 


NIRC2 OSIRIS 
flux density FWHM S/N 

L’ [mJy] Bry [mJy] _ [Felll] [mJy] spectral [km/s] spatial ["] Bry __ [Felll] 
G3 2.5+0.5 1.1640.15 0.59+0.11 15645 0.1040.02 24 14 
G4 <0.38+0.08 1.1040.14 0.49+0.09 11742 0.1040.02 34 16 
G5 <0.57+0.11 0.9940.13 0.44+40.08 110+2 0.1240.02 17 16 
G6 <0.54+0.10 0.8240.12 0.3740.07 12745 0.13+40.02 28 19 
G1 0.60.05 0.48+0.15 <0.027+0.005 293420 0.10+0.04 8 - 
G2 2.1240.15 0.6540.19 <0.027+0.003 174414 0.08+0.04 13 * 


We report properties of this emission: flux densities, spectral and spatial widths (as FWHM), and signal-to-noise ratios (S/N). The L' flux densities come from NIRC2 2012 measurements (G1 is 
brighter in earlier epochs”). The total flux density for Bry and [Fe 11!] (2.2184 um) come from OSIRIS (average of all observing). For comparison, we report G1 and G2 measurements from 2006 
where all G objects are detectable in OSIRIS data. All fluxes are dereddened™. The spectral and spatial FWHM (an average of the x- and y-FWHM) are measured for Bry. These values are not 
corrected for instrumental line width and PSF size (respectively ~100 kms" and ~75 mas). The S/N of the other [Fe ti!] line has a value of ~6 for all objects. 
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Classification is an important task at which both biological and artificial neural 


networks excel’”. In machine learning, nonlinear projection into a high-dimensional 
feature space can make data linearly separable*“, simplifying the classification of 
complex features. Such nonlinear projections are computationally expensive in 
conventional computers. A promising approach is to exploit physical materials 
systems that perform this nonlinear projection intrinsically, because of their high 
computational density”, inherent parallelism and energy efficiency®’”. However, 
existing approaches either rely on the systems’ time dynamics, which requires 
sequential data processing and therefore hinders parallel computation*°*, or employ 
large materials systems that are difficult to scale up’. Here we use a parallel, nanoscale 
approach inspired by filters in the brain’ and artificial neural networks? to perform 
nonlinear classification and feature extraction. We exploit the nonlinearity of hopping 
conduction? “ through an electrically tunable network of boron dopant atoms in 
silicon, reconfiguring the network through artificial evolution to realize different 
computational functions. We first solve the canonical two-input binary classification 
problem, realizing all Boolean logic gates” up to room temperature, demonstrating 
nonlinear classification with the nanomaterial system. We then evolve our dopant 
network to realize feature filters? that can perform four-input binary classification on 
the Modified National Institute of Standards and Technology handwritten digit 
database. Implementation of our material-based filters substantially improves the 
classification accuracy over that of a linear classifier directly applied to the original 
data’™’. Our results establish a paradigm of silicon-based electronics for small- 
footprint and energy-efficient computation”. 


Doping is acrucial process in semiconductor electronics, where impu- 
rity atoms are introduced to modulate the charge carrier concentration. 
Conventional semiconductor devices operate in the band regime of 
charge transport, where the delocalization of the charge carriers gives 
rise to high mobility and a linear response to an applied electric field. 
At sufficiently low doping concentration and temperature”, however, 
delocalization is lost, and carriers move sequentially from dopant atom 
to dopant atom. This is referred to as the hopping regime!°™*, which 
exhibits higher resistivity and nonlinearity. Nonlinearity is often unde- 
sired, but it is a valuable asset for unconventional computing, that is, 
for systems that do not follow the Turing model of computation® *””. 
Rather than excluding nonlinearity, we can exploit it” and manipulate 
our physical system with artificial evolution to solve computational 
problems”. This evolution in materio has been used, for example, for 
frequency distinguishing by liquid crystals’* and robot control with 
carbon nanotubes”. We recently showed that a disordered network of 
gold nanoparticles acting as single-electron transistors can be evolved 


into any Boolean logic gate at sub-kelvin temperatures”. By exploiting 
the physics of materials for computation at the nanoscale through 
evolution, we may realize systems with unprecedented computational 
density and efficiency that are too complex to design”®. 

Here, we fundamentally advance our previous work” by expanding 
the functionality, exploiting the well established platform of silicon 
technology and demonstrating operation up to room temperature. 
According to Cover’s theorem‘, complex, linearly inseparable classi- 
fication problems, when nonlinearly and sparsely mapped toa higher- 
dimensional space, can transform into linearly separable problems. 
The essence of this nonlinear mapping is illustrated in Fig. 1a for the 
XOR classification problem. To save resources, this projectionis often 
done implicitly by using kernel functions in machine learning, that is, 
without explicit computation of high-dimensional coordinates’. In 
artificial neural networks (ANNs), the nonlinear projection is learned 
by adjusting internal weights, traditionally through back-propagation, 
leading to powerful feature extractors”. However, emulating ANNs with 
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Enschede, The Netherlands. “Molecular Materials and Nanosystems and Center for Computational Energy Research, Department of Applied Physics, Eindhoven University of Technology, 
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Fig. 1| Simplifying classification by nonlinear projection. a, Inthe XOR 
classification problem two classes of data (red circles for (1,0), (0,1) and blue 
squares for (0,0), (1,1)) cannot be linearly separated in two dimensions (x,,.x,; 
left). When nonlinearly transformed to three dimensions (@,, @2, @3; middle), 
the data can be linearly separated according to their distances d (right) toa 
decision boundary (yellow plane in the middle panel). b, Schematic 
representation of the potential landscape of the dopant network. Inthe 
hopping regime, the potentials of Ndopants (purple spheres) spana high- 
dimensional feature space. Yellow spheres represent charge carriers. The 
voltage-time (V-¢) diagrams on the left schematically show the voltage 
combinations applied to the input electrodes (red), affecting the potential 
landscape and projecting information nonlinearly to the feature space. Note 
the difference between the potential landscapes inthe top and bottom panels 
for different input voltages. The characteristics of the output current (yellow 
electrode) are tunable by the control voltages (grey electrodes). 


conventional complementary metal—oxide-semiconductor (CMOS) 
technology is known to be power-inefficient”, and CMOS scaling is 
not keeping pace with ANNs“. To avoid the area and power costs of 
emulating neurons and synapses, reconfigurable? material systems 
with intrinsic complexity and diversity of nonlinear operations®**? 
are strongly sought after. 

Our system consists of a disordered network of boron dopants in sili- 
con (Si:B) and is illustrated in Fig. 2a, b. The boron atoms were implanted 
in n-type silicon with a concentration of 2 x 10 cm at the surface 
(Methods, Extended Data Fig. 1). A300-nm-diameter active region 
was defined by eight electrodes. The central silicon region was etched 
(about 80 nm deep) so that the boron concentration at the receded sur- 
face was reduced to about 5 x 10” cm, as confirmed by secondary-ion 
mass spectroscopy. The current-voltage (/-V) characteristics (Fig. 2c, 
Extended Data Fig. 2) become increasingly nonlinear with decreasing 
T, and can be modelled as electric-field-activated hopping conduc- 
tion at low temperatures (Supplementary Notes 1, 2). The network’s 
potential landscape (Fig. 1b) depends ina highly nonlinear way onthe 
input and control voltages, and spans a high-dimensional space. 
The output current is determined by this complex potential landscape. 
The nonlinear projection is realized when a combination of two or more 
input voltages is converted to an output current. 
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To identify the charge transport regimes, we focus on the low-bias 
conductance” G = diy/dVeply,, --10 mv, Where /p is the drain current and 
Vep is the source-drain voltage: 


G(T) = Gye2o"87) + Ge TW?” (1) 


where the first term describes band (b) conduction and the second 
term describes hopping (h) conduction. G,andG, are pre-factors with 
a much weaker temperature (7) dependence than the exponential 
terms, &, is the dopant ionization energy, 7;, is a characteristic tem- 
perature of hopping conductionand k, is the Boltzmann constant. The 
exponent p depends on the specific hopping model”. The resistance 
R=1/Gasa function of inverse temperature 1/T is shown in Fig. 2d. At 
T> 250K, hole-band conduction dominates. The extracted ¢, is about 
130 meV, three times larger than the value of boron in bulk silicon, 
about 45 meV. We attribute this increased ionization energy to dopant 
deactivation™”: for hydrogen-like dopants near the silicon surface, 
the decreased dielectric screening leads to stronger electron confine- 
ment, and therefore a larger ionization energy. 

We adopt the method proposed by Zabrodskii et al.” to distinguish 
the hopping regime and extract p (Methods). For 70-160 K, we find 
p=0.342 + 0.023, in agreement with p = 1/3 predicted for two-dimen- 
sional Mott variable-range hopping (Mott-VRH)””* (Fig. 2e). The two- 
dimensional nature implies that the dopants participating in transport 
are located close to the silicon surface, because the hopping resistance 
increases exponentially with inter-dopant distance”, which is low- 
est near the surface. This is consistent with the dopant deactivation 
observed in the band-conduction regime. Above about 160 K, band con- 
duction starts to contribute, becoming dominant above about 250 K. 

To demonstrate classification in the hopping regime (Supplementary 
Notes 3-7), we followed the evolutionary approach of ref.” (Methods) 
and configured the system into Boolean logic (Fig. 3a—c, Extended Data 
Figs. 3-5) at 77 K. The working-temperature window for aset of control 
voltages (about 30 K) is approximately 15 times wider than in our previ- 
ous nanoparticle system” (about 2K). The retention period of the gates 
is over two months in liquid nitrogen, and the device characteristics 
remain virtually unchanged after thermal cycling, indicating the robust- 
ness of the dopant network. Boolean logic represents a prototypical 
two-input binary classification problem’, and the XOR classification 
problem is a poignant example of a single-layer perceptron’s inability 
to solve problems with linearly inseparable vectors”. Hence, solving 
the linearly inseparable X(N)OR problem demonstrates the system’s 
separation ability*””” (Fig. 1a). 

As realizing all Boolean logic gates with a standard ANN requires at 
least one hidden layer of two neurons? (corresponding to nine linear 
and three nonlinear operations), our dopant network can be considered 
to emulate at least such a neural network in hardware (Fig. 3d). Impor- 
tantly, the dopant network has only a300-nm-diameter footprint and 
an average power consumption of about 1 pW (Methods). Using estab- 
lished monolithically integrated readout circuits (Methods, Extended 
Data Fig. 6), the bandwidth of the readout circuitry can be increased 
from 40 Hzin our current setup to over 100 MHz. With optimization 
(Methods and Supplementary Note 8), the energy efficiency of the 
dopant network at 77 K is projected to exceed 100 tera-operations 
per second per watt (TOPs ‘W 7“, where OPis one typical linear operation 
of aneural network”’), one order of magnitude higher than a state-of- 
the-art customized CMOS neural network accelerator”? (Supplemen- 
tary Notes 8, 9, Extended Data Fig. 7b). 

To investigate the correlation between the functionality of our 
devices and the transport mechanism, we performed random searches 
with 10,000 sets of control voltages as a function of temperature. We 
define the total abundance A, representing the overall probability 
of finding Boolean logic, with two fitness Fthresholds for each logic 
gate” (Methods). For both fitness thresholds F > 1, 2, the total abun- 
dance drops to below 5% when band conduction sets in at around 


Fig. 2| Device structure and charge transport 
mechanism. a, Scanning electron microscope 
image, indicating the source (S) and drain (D) 
contacts for /-Vmeasurements. b, Schematic cross- 
section, illustrating the doping profile andthe p-n 
junction (yellow dashed line). c, /-V characteristics at 
different temperatures (7) showing nonlinear 
behaviour below about 250 K. d, Resistance R versus 
inverse temperature at V;, =-10 mV. Band transport 
is observed for 250-295 K (indicated by the red line in 


the main figure and the inset, which shows the high-T 
region). e, Logarithmic derivative of the low-bias 
conduction Gwith respect to 7. The linear segment 
for 70-160 K indicates hopping conduction (blue 


500 


line). Inset, semi-logarithmic plot of R versus 1/7“, 
indicating two-dimensional variable-range hopping 
for 70-160 K (blue line) with 7;,=7.7 x10*K, falling 
well within the range reported for Mott’s VRH 


7} model’®. 
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160 K (Fig. 3e). Hence, functionality is highly correlated to the hop- 
ping regime. 

Led by this correlation, we tried to increase the operating tempera- 
ture by suppressing band conduction. With increasing temperature, 
dopants near the p-njunction (Fig. 2b) are expected to be ionized first, 
as they are less subject to deactivation than dopants far away from 
the junction. By depleting the junction using a backgate, we indeed 
observe nonlinearity, and can evolve all six major logic gates at room 
temperature (Fig. 3f, Extended Data Fig. 8). The confirmed correlation 


© o24 AND 


between functionality and the charge transport mechanism can serve 
as a guiding tool towards robust functionality at room temperature. 
To demonstrate the ability of our device to perform more compli- 
cated classification tasks, we performed four-input binary classification 
inthe form of filtering 16 2 x 2 black (1) and white (0) pixel features, as 
shown inthe inset of Fig. 4a. The four pixel values are encoded as four 
input voltages to our dopant network, together with three control 
voltages and one output current. We use the three control voltages to 
evolve a single network into 16 different filters at 77 K. Each filter should 
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Fig. 3 | Evolution of Boolean logic. a, Schematic electrode configuration, 
indicating input voltages (V,,, Vj,2), control voltages (V..-V.;) and output 
current (/,,,.). b, Input waveforms. The logic O and Lare represented here by two 
different voltages, 0 Vand 0.5 V, respectively (see also Supplementary Note 6). 
c, Major Boolean logic gates at 77 K (experimental current values in red, desired 
output normalized to the experimental datain black). We reproduced all 
Boolean logic gates in seven devices. d, ANN with two hidden neurons (green 
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filled circles) emulated by the dopant network device. The ANN requires six 
(linear) weight multiplications, three (linear) summations and three 
(nonlinear) activations. e, Total abundance of logic gates (defined in Methods) 
as a function of temperature. The dashed line marks the onset of band 
conduction. The blue and red curves correspond to fitness thresholds of F>1 
(noise level) and F>2, respectively. f, XOR and XNOR gates evolved at room 
temperature with a backgate voltage of about 12 V. 
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Fig. 4 | Feature filtering and handwritten digit classification. a, Current 
response of one of the 16 filters. The 2 x 2 pixel black/white patterns (inset) are 
represented by ‘0000’, ‘0001’ ..., ‘1111’, with black (1) and white (0) mapped to 
input voltages 0.5 Vand-—0.5 V, respectively. The output current of this filter is 
maximal when the ‘1011’ patternis presented. Error bars represent the standard 
deviation of ten tests. b, Feature mapping for digit recognition. Specific filters 
are activated (bold dark squares) depending on the features presented tothem. 


make one of the 16 features distinguishable from all the others, whichis 
realized by evolving the dopant network such that it yields the maximal 
or minimal output current for that specific feature (Fig. 4a, Extended 
Data Fig. 9). If we feed a feature to a group of 16 filters, each of which 
distinguishes one feature, then the 4-dimensional data are mapped to 
a16-dimensional vector, and each feature vector is separated fromthe 
others in one of the dimensions (Supplementary Note 9). 

Our approach allows the separation of data by evolving filters that 
are capable of processing data in parallel and with high throughput. 
Compared with optical networks, which also allow parallel processing, 
our dopant networks feature tunability and have much smaller dimen- 
sions: about 100 nm instead of centimetres’. 

Taking advantage of the separation ability of our nanomaterial 
system, we used the evolved filters as the core ingredient to classify 
the Modified National Institute of Standards and Technology (MNIST) 
digits’. The whole classification procedure consists of a feature map- 
ping layer of the evolved filters inspired by the convolutional neural 
network’, followed by a linear classifier in a traditional computer, 
which can in principle also be realized in materio” (Fig. 4b). The 
28 x 28 greyscale pixels of each MNIST digit are converted to black 
and white using a threshold and divided into 2 x 2 pixel receptive fields 
(overlapping in one row/column with neighbours). The receptive 
fields feed their signal to the cluster of 16 filters, each filtering out 
one of the features. The (28 x 28)-dimensional MNIST data are hence 
mapped onto (27 x 27 x 16)-dimensional feature vectors. The linear 
classifier then converts these high-dimensional feature vectors to 
a10-dimensional output by a weight matrix My (Fig. 4b), obtained 
by pseudo-inverse learning” with the 60,000 MNIST training data 
(Methods). The largest of the ten outputs finally determines the rec- 
ognized digit. 
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For clarity, most of the 27 x 27 x 16 filters are not shown. The output of the filters 
is obtained from the experimental data shown inaand Extended Data Fig. 9. 
Theten output nodes, representing digits 0 to 9, are connected tothe filters 
through a weight matrix My ofa linear classifier. c, Confusion matrix of 
classification with the 10,000 MNIST test dataset, showing that 96.0% of the 
digits are correctly classified. 


Application to 10,000 test digits shows 96.0% accuracy (Fig. 4c, Sup- 
plementary Note 9, Extended Data Fig. 10), which is better than the 
accuracy obtained with state-of-the-art physical reservoir computing® 
and optical networks’. We note that differences in the output current 
scales of the different filters are irrelevant, because the weight matrix 
will automatically compensate for those (Supplementary Note 9). We 
also simulated feature filters with ideal characteristics, which are only 
activated when presented with its corresponding feature (output 1 
for target feature and O otherwise). The classification of the MNIST 
dataset with these ideal filters results in an accuracy of 96.2%. There- 
fore, as long as the data mapped to the feature space are sufficiently 
separated, a linear classifier can learn the decision boundaries. The 
underlying reason is that every complete set of independent vectors, 
be it orthogonal (ideal) or not, can represent other vectors by linear 
combination. This shows the power of our dopant network in making 
data linearly separable, owing to its intrinsic nonlinear transformation. 
The ability to separate data, when combined with an adaptable linear 
readout ina scaled-up system, can achieve universal computational 
power®”””3, For instance, in ANNs, perceptrons can be cascaded to 
solve more complex problems’. This analogy strongly suggests that a 
system of interconnected dopant networks can address a much wider 
range of tasks, particularly because the computational power of asingle 
dopant network is larger than that of a single perceptron (it can solve 
XNOR whereas a single perceptron cannot). 

At the system level, we anticipate a number of necessary develop- 
ments. First, the total evolution time of the filters, which scales linearly 
with their number, can be reduced (by a factor 10°; Methods and Sup- 
plementary Notes 7, 8). Besides competitive evolutionary approaches”, 
we willalso explore gradient-based methods”. Secon4d, it will be highly 
advantageous to store the evolved control voltages locally, employing, 


for example, memristors”° (Supplementary Note 8). Third, memristive 
technology is also suitable for in materio implementation of the linear 
classification step in our scheme with energy efficiency comparable to 
our material-based nonlinear feature filters. Fourthly, processing ana- 
logue instead of binary signals would be more natural for our devices. 
To filter more complex, non-binary features, such as edge detection 
by the brain’, more electrodes per device are needed and/or multiple 
devices need to be interconnected, so that more input signals can be 
processed in parallel. This will also allow for more control voltages per 
filter (at present, three) to improve the signal-to-noise ratio. Lastly, for 
practical applications, room-temperature operation with long reten- 
tion, low-voltage supplies and without a backgate is desired, which we 
deem possible by engineering the deactivation effect in a silicon-on- 
insulator-based system. 

Our silicon-based system provides a powerful platform for carry- 
ing out machine learning tasks in hardware. By material learning, we 
harness the intrinsic nonlinearity and tunability of ananomaterial 
system to efficiently realize functional tasks without the need to design 
circuitry for the underlying elementary operations. The small foot- 
print and silicon-compatible fabrication process facilitates scaling 
up for massively parallel, high-throughput information processing 
platforms for complex computational tasks. Whereas the random- 
ness and discreteness of dopants pose challenges on conventional 
silicon electronics, we have presented a computational paradigm that 
takes full advantage of these properties. When integrated with other 
technologies, complex classification problems can be solved fully in 
materio, potentially achieving ultrahigh computational density and 
energy efficiency". 
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Methods 


Samples 

300 nm of thermal oxide was grown on an n-type silicon substrate 
(Extended Data Fig. 1a), in which 26 x 60 pm’ implantation windows 
were defined by photolithography and wet etching. Another 35 nm 
of oxide was thermally grown in the implantation window to serve 
as a Stopping layer (Extended Data Fig. 1b). After boron implantation 
(9 keV equivalent, 3.5 x 10% cm”), and activation via rapid thermal 
annealing (1,050 °C, 7 s; Extended Data Fig. Ic), the 35-nm stopping 
layer was removed by wet etching. The boron concentration near the 
silicon surface exceeds 2 x 10° cm™*to ensure Ohmic contact with the 
electrodes, and decreases monotonically with depth (Extended Data 
Fig. 1h). After lift-off of the wire-bonding pads (1.5 nm Ti/40 nm Pd) 
defined by photolithography (Extended Data Fig. 1d), eight 1.5 nm 
Ti/40 nm Pd nanoelectrodes were patterned on top of the silicon by 
electron-beam lithography (Extended Data Fig. le). The devices were 
annealed at 160 °C for 10 min to promote the metal/silicon contact 
quality. The silicon surface was further etched by reactive ion etching 
to reduce the boron concentration in the active gap area (Extended 
Data Fig. 1f, g; see also Supplementary Note 6). The surface was finally 
treated with mild oxygen plasma, followed by 1% HF etching to remove 
possible contaminants. 


Charge transport 

Following Zabrodskii et al.”, we introduce the logarithmic derivative 
w=d(logG)/d(logT). From equation. (1), we see that if the hopping term 
G,e 7/”’dominates, logw~logp + p(log7,-log7), and p canbe derived 
from the slope of the logw-logT curve (Fig. 2e), thus allowing us to 
identify the exact hopping conduction model. For 7< 70 K, the meas- 
urement noise level prevents unambiguous identification of the charge 
transport mechanism (Fig. 2d), but probably VRH continues**. The 
charge transport behaviour described in the main text has been 
observed in the two devices we characterized. 


1.5 


Measurements 

We conducted the charge transport measurements and evolution of 
logic gates at different temperatures in a customized flow cryostat. 
The cryostat is equipped with 12 coaxial cables to reduce capacitive 
cross-talk. We use a battery-powered electronics rack (IVVI rack and 
matrix rack; http://qtwork.tudelft.nl) composed of digital-to-ana- 
logue converters (DACs) and //V converters for low-noise measure- 
ments (Extended Data Fig. 6). The output range of the DACs is from 
-2V to 2V. The //V converter has four amplification settings, 1 GQ, 
100 MQ, 10 MO and 1 MoO, each corresponding to a different meas- 
urement range. For measurements at cryogenic temperatures, 1 GO 
amplification is chosen as default, by which currents from —3.4 nA to 
3.4 nA can be measured. The output of the //V converter is sampled 
by a multimeter (Keithley 2000) or digitizer (ADwin-Gold II). For 
room-temperature evolution, the //V converter amplification is set 
to10 MQ, resulting ina current measurement range from -340 nA to 
340 nA. The measurements are automated with either LabVIEW or 
Python scripts. For fixed-temperature measurements, the devices 
were inserted into a liquid-helium (4.2 K) or liquid-nitrogen (77 K) 
dewar with a customized dipstick. 


Readout speed 

In our system, the relaxation time of hopping conduction is less than 
10 ns at 77 K and even smaller at higher temperatures (Supplementary 
Note 8), so itis not the dominant timescale in our present devices. Like 
in all measurements on resistive devices®, the readout speed of our 
dopant networks is constrained by a large capacitive load (Extended 
Data Fig. 6b). The long, twisted pairs (about 3 m) as wellas the filters of 
the matrix rack amount toa large load capacitance C, (about 4 nF) that 
limits the signal speed. With the existing setup, we have a bandwidth 


(cutoff frequency of the resistor—capacitor (RC) circuit in Extended 
Data Fig. 6b) 


1 


BW=.—— 
2mC,(RourllRiy) 


= 40 Hz 


where Ry = 1 MO is the input resistance of the //V converter at 1GQ 
amplification, and the dopant network output resistance R,,, is typi- 
cally hundreds of MO (Extended Data Fig. 7d). 

By monolithically integrating a transistor-based readout circuitry 
close to the dopant network® (Extended Data Fig. 6c), we canreduce the 
capacitive load for fast readout, and also enable interconnection with 
other devices. With existing CMOS technology, the load capacitance 
can be easily reduced to below 1 fF, and the RC-related bandwidth can 
reach 160 MHz, or even more, by reducing Ry. 

Given a signal intensity (the difference between high and low output 
current levels; see ‘Fitness functions’ below), the signal-to-noise ratio 
(SNR) is predominantly set by the Johnson-Nyquist noise from Ry, 
because its noise power is proportional to the bandwidth. Therefore, 
for a required SNR (computation precision), the bandwidth and the 
subsequent energy efficiency, are determined by the signal intensity 
(Supplementary Note 8). The signal intensity of our devices ranges from 
the order of 0.1 nA to the order of 1nA (Supplementary Note 6), thus 
allowing over 100 MHz bandwidth (Extended Data Fig. 7a). 


Fitness functions 

For Boolean logic gate evolution, the input sequences, representing 
the four input entries of truth tables (Fig. 3b), were fed to the input 
electrodes (Fig. 3a) after the control voltages were set. We monitored 
the output current waveform Y and fitted it with Y = mX + C, where X 
is the expected output waveform of a logic gate (logic high and low 
taking numerical values of 1 and 0, respectively). mis the proportion- 
ality factor and its value thus equals the separation of the high and low 
levels (signal intensity). Crepresents the offset. For each set of control 


voltages, a fitness is evaluated by F= mI (Jf + kc), with r., being the 
fitting residual” and k an empirical constant. A larger k puts more 
emphasis on minimizing the offset Cin the evolution process. For the 
evolution of logic gates, we found that there is minimal offset for k=0.2 
(Fig. 3c). Inthe random search of logic gates at different temperatures, 
khas been set to 0.01 to give the waveform shape more weight than 
the offset. Then, a fitness value of F=1implies that the signal intensity 
(related to m) almost equals the noise intensity (related to ales)» anda 
fitness value of F=2 corresponds to more robust logic gates. Based on 
the fitness, we define the abundance of each gate. For the 10,000 out- 
put waveforms from a random search, we assessed the fitness of each 
output waveform for six major logic gates. In this way, each logic gate 
is associated with 10,000 fitness values. The abundance of a gate A; 
(whereiis AND, OR, NAND, NOR, XOR, XNOR) is defined as the number 
of fitness values larger than a threshold, divided by 10,000. The total 
abundance is then defined as A = 1/); (1/A;). The fitness function for 
the feature filter evolution was defined as F= |I,u,//L[aVg(ourjeil) + Std (/ 
outjei)], where /,,,.;is the output current corresponding to feature/,, and 
avg and std stand for the average and standard deviation, respectively, 
of all the other feature outputs /,.,;., Here, iruns from 1 to 16. 


Genetic algorithm 

The genetic algorithm mimics natural evolution. An initial generation 
of 20 genomes, with the length of each genome equal to the number of 
control electrodes, is first randomly generated and mapped to control 
voltages. The fitnesses of the 20 genomes are evaluated and ranked. 
Then the off-spring generation of 20 genomes is produced in the fol- 
lowing way: (1) inheriting the five elite genomes (with highest fitnesses) 
from the previous generation; (2) cross-breeding of the elite genomes to 
produce five off-spring genomes; (3) mutation of the five elite genomes 


bya probability of 0.1, then cross-breeding with the five elite genomes 
to generate five other genomes; (4) cross-breeding of the five elite 
genomes with five random genomes to generate five other genomes. 
The genetic algorithm keeps iterating until it reaches a satisfactory 
fitness value (Extended Data Fig. 4a; see also Supplementary Note 7). 
Amore detailed description of the evolution procedure is given in our 
previous work”. 


Power consumption 

To estimate the power consumption and energy efficiency of our device, 
we measured the static power consumption of the six major Boolean 
logic gates for four different input voltage combinations, so in total 
24 configurations. To measure the current of the ith (irunning from1 
to 8) electrode, the voltage V; (current /,) is set (measured) by asource 
meter (Keithley 2401), while the voltages on the other electrodes are 
set by either the DACs (control voltages and input voltages) or an //V 
converter (output electrode). For each of the 24 configurations, the 


total power Pis calculated as P= phan V,;. The average power of the 24 
configurations is found to be about 1 pW. Under operational conditions, 
the voltage changes on the electrodes are accompanied by charging 
and discharging of wire capacitances. As mentioned above (‘Readout 
speed’ section), the capacitances can be reduced to below 1 fF, making 
the dynamical power consumption negligible compared with the static 
power consumption. The static power consumption could be substan- 
tially reduced by using electrostatic electrodes (see also Supplementary 
Note 8). 


Weight matrix training and test 
Inthe digit classification task, each 28 x 28 pixel digit is divided into 27 x27 
receptive fields of 2 x 2 pixels, overlapping by one row/column of pixels. 
The pixels of each receptive field are mapped to the 4 inputs of 16 filters 
(with their experimentally determined response), each of which 
filters 1 of the 16 distinctive 2 x 2 pixel features shown in the inset of 
Fig. 4a. For the ath digit in the N;=60,000 MNIST training database, we 
stack the N= 27 x 27 x 16 = 11,664 outputs of the filters in a feature vec- 
tor Oy = (Oy, «+, Og,y,)- Combining the vectors O, of 60,000 training 
digits together, we obtain an N,x N,output matrix O = (O,, ---, Oy)". 
The true label of each digit is represented by a ten-dimensional label 
vector L,, whose elements are all zeros except for the (/+ 1)th entry 
being 1, where /e {0, ..., 9} is the true label of the dth MNIST digit. 
Ideally, the weight matrix My, converts the feature vector of a digit to 


its corresponding label vector O,M,,=L,. So, in matrix form, OM, =L, 
whereL = (L,, ---, Ly,)'. The weight matrix My has a dimension of N;* 10, 
and is simply obtained by My = OL, where O* is the pseudoinverse of 
matrix O. Once the weight matrix is trained, we test it with the 
N,=10,000 MNIST test data. The feature vector of each test digit O,, 
(t=1,...,N,),is multiplied by the weight matrix to acquire the predicted 
label vector P,, OMy =P,. 

The index of the maximal element of P, minus one gives the predicted 
label. The accuracy is calculated as the ratio of the total counts of the 
correctly classified digits, that is, the sum of diagonal entries in Fig. 4c, 
to the total number of test digits N,. 


Data availability 


Data are available from the corresponding author upon reasonable 
request. 
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Extended Data Fig. 1| Fabrication steps and dopant concentration. 

a, Thermal oxidation. b, Implantation window definition and growth of 35nm 
oxide.c, lon implantation. d, Photolithography and contact pads lift-off. 

e, Electron-beam lithography and nanoelectrodes lift-off. f, Reactive ion 
etching (RIE) of silicon. g, Height profile of the metal electrodes with respect to 
silicon before (black) and after (red) RIE etching. The etch depth of siliconis 
estimated by measuring the height change of the metal electrodes with respect 
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tothe silicon surface (indicated by the black line on the atomic force 
microscopy image in the inset, not to scale). Assuming that the metal is not 
etched by RIE, the etch depth of silicon is around 83 nm. h, Secondary ion mass 
spectroscopy of the boron dopant depth profile after implantation. Onthe 
basis of the etch depth, the boron concentration near the recessed silicon 
surface is of the order of 5x10” cm?. 
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Extended Data Fig. 2| Nonlinear and tunable hopping conduction. a, /-V 
characteristics at 4.2 K with different total etching time by RIE. As the total 
etching time increases, the nonlinearity becomes increasingly prominent, 
signalling the dominance of hopping conduction. b, Drain current versus 
control voltage for constant source-drain voltage V;, =1.2 Vat 4.2K. The 
source (S), drain (D) and control (C) electrodes are shown in the inset. The 
hysteresis for negative gate voltage is probably due to charging of the other 
five floating electrodes. c, Schematic plot of electrochemical potential 
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versus positions, illustrating the tunability. The solid lines represent impurity 
states and the arrows represent hopping of carriers among states. See 
Supplementary Note 3 for detailed discussion. d, Fitting the temperature- 
dependent /-V curves with the model described by equation (2) in 
Supplementary Note 2. Black dashed lines represent the fitted curves. 

e, Conductance versus the reciprocal of the cube root of the source-drain 
voltage at different temperatures. The black circle groups data at temperatures 
below 140 K. See also Supplementary Note 2 for more discussion. 
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Extended Data Fig. 3 | Evolved logic gates at 77 K.a, Abundance plot of 14 non- 
trivial truth tables at 77 K. Fromasearch with 10,000 sets of randomly 
generated control voltages, we found all 16 possible truth tables that can be 
realized for atwo-input-one-output configuration. b, Thermal stability ofa 
NAND gate evolved at 77 K. Above 140K, the output current clipped to 
compliance, and therefore the fitness was not extracted. The error bars 
represent the standard deviation of ten tests (see also Supplementary Note 4). 
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c, Boolean logic gates evolved at 77 K ina device other than the one in Fig. 3c. 
Red circles are experimental output currents, and black lines represent the 
normalized desired output currents. The left six panels show the six major 
logic gates evolved with input voltage levels 0 Vand 0.5 V. The right two panels 
showa NAND anda XNOR gate evolved with input voltage levels of -0.25 Vand 
0.25 V, showing the adaptability of the dopant network to different voltage 
levels (see also Supplementary Note 6). 
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Extended Data Fig. 4 | Convergence of genetic algorithmin the 
configuration space. a, Genetic algorithm convergence for the six major 
Boolean logic gates at 77 K. The best fitness of the 20 genomes is plotted 
against generation. b, Histograms of the control voltages that configure the 
dopant network to the XNOR gates with fitness Flarger than1.5. The first 
control voltage is prominently concentrated ina small range, but the others do 
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not showa favourite range. The ranges of the five control voltages are (-600, 
600), (-1,200, 1,200), (-1,200, 1,200), (1,200, 1,200) and (-600, 600). 

c, Control voltages for the six major logic gates. d, Control voltages for the 16 
filters, which are visualized ine. The filters ‘0110’ and ‘0010’ have the smallest 
separation. See Supplementary Notes 3 and 7 for more discussion. 
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mechanism is still VRH (Methods). b, Evolved logic gates at 140 K. Red circles 
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Extended Data Fig. 6 | Measurement setup. a, Schematics of the existing feedback resistance of the //Vconverter, respectively. c, Schematic ofan 
measurement setup. b, Equivalent circuit of the current measurement setup. integrated high-speed current reading circuit. Here, Ry is aresistor to convert 


7,u,.and R,,, represent the output current and output resistance ofthedevice.C, | currentto voltage, C, is the parasitic capacitance that can be reduced to below 
is the parasitic capacitance of about 4 nF. Ry and R; are the input resistance and 1fF. Rois aresistor that sets the amplification. 
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Extended Data Fig. 7| Bandwidth and energy efficiency scaling. a, The computational technologies, the most energy efficient high-performance 
scaling of allowed bandwidth with signal intensity in a log-log plot. The back, computer’®, the neural network (NN) accelerator?’ and memristors” 
blue and red solid lines represent three different indicated cases. Larger (Supplementary Note 8). c, Current flow pattern of a NAND gate (NAND1O ind) 
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Extended Data Fig. 8 | Backgate-induced nonlinearity and evolved logic 
gates at room temperature. a, A positive voltage V;,,, with respect to the drain 
voltage is applied to the n-type substrate (Fig. 2b) to make the depletion region 
wider at the p-njunction, and to suppress the band conduction. b, Evolved 
gates at room temperature. Red circles are experimental outputs, and black 
lines represent the normalized desired outputs. The output current levels, and 
also the separation between these levels, are more than one order of magnitude 
larger than those of the logic gates evolved at 77 K, owing to the increased 
hopping conductance (Supplementary Note 3). The increased noise intensity is 
mainly due to the settings of the current measurement circuit (Methods). 
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Extended Data Fig. 9 | Experimental response of the16filters.Eachofthemis thetargeted feature is clearly separated from the other currents. Error bars 


evolved to filter the feature given in blue. The output currents corresponding represent the standard deviation obtained from ten tests. 
to features other than the desired one are not zero, but the output current of 
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Extended Data Fig. 10 | Enhancing robustness of the linear classifier against 
noise. Besides optimizing the SNR, the linear classifier’s tolerance to noise can 
also be increased by taking noise into account during the training phase. The 
accuracy remains over 92% at 0.05 nA noise amplitude (see Supplementary 
Note 8 for a detailed discussion). 
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Highly efficient neutron detectors are critical in many sectors, including national 
security’”, medicine’, crystallography‘ and astronomy”. The main neutron detection 
technologies currently used involve *He-gas-filled proportional counters* and light 
scintillators’ for thermalized neutrons. Semiconductors could provide the next 
generation of neutron detectors because their advantages could make them 
competitive with or superior to existing detectors. In particular, solids with a high 
concentration of high-neutron-capture nuclides (suchas °Li, !°B) could be used to 
develop smaller detectors with high intrinsic efficiencies. However, no promising 
materials have been reported so far for the construction of direct-conversion 
semiconductor detectors. Here we report on the semiconductor LilnP,Se, and 
demonstrate its potential as a candidate material for the direct detection of thermal 
neutrons at room temperature. This compound has a good thermal-neutron-capture 
cross-section, a suitable bandgap (2.06 electronvolts) and a favourable electronic 
band structure for efficient electron charge transport. We used a particles from an 
414m source as a proxy for the neutron-capture reaction and determined that the 
compact two-dimensional (2D) LilnP,Se, detectors resolved the full-energy peak with 
an energy resolution of 13.9 per cent. Direct neutron detection from a moderated Pu- 
Be source was achieved using °Li-enriched (95 per cent) LilInP,Se, detectors with full- 
peak resolution. We anticipate that these results will spark interest in this field and 
enable the replacement of "He counters by semiconductor-based neutron detectors. 


Direct neutron detection presents a tremendous challenge because 
neutrons interact weakly with most matter®. Neutron detectors exploit 
the properties of certain nuclides that decay into highly energetic 
charged fragments upon capture of a neutron, providing easily detect- 
able signals through ionization products’. In practice, only *He, °B and 
®Licombine high neutron-capture cross-sections with detectable decay 
products, and these isotopes have been incorporated into various 
detector architectures. The detector of choice for several decades has 
been the *He-gas-filled proportional counter. However, in the past two 
decades, the stockpile of *He has been greatly depleted with no viable 
means for increasing production to meet demand’. Therefore, many 
alternative technologies have attracted interest, including “BF,-filled 
tubes, B-lined tubes, °LiF-based scintillators, lithium-loaded plastic 
scintillators’,°LiF-filled micro-structured semiconductor detectors” 
and neutron-sensitive semiconductors. These technologies have advan- 
tages and disadvantages", but so far no widespread replacement for 
3He tubes has been found. 

®Li- and ’°B-containing semiconductors are emerging technolo- 
gies that promise highly efficient detectors because the concentra- 
tion of neutron-absorbing isotopes is much greater in solids. There 
are two classes of semiconductor neutron detectors: indirect- and 


direct-conversion detectors”. The thermal-neutron detection effi- 
ciency for indirect-conversion semiconductors reaches a maximum of 
about 40%”. Direct-conversion semiconductors use a single material 
for both neutron capture and charge collection, enabling a simpler 
detector geometry, with intrinsic thermal-neutron detection effi- 
ciencies approaching 100%. The semiconductors, however, must be 
extremely pure and with low carrier trapping because neutron fluxes 
can be very low, creating very small numbers of excited charges. Mate- 
rials used for direct conversion include LilnSe,"* ° and hexagonal-BN 
(h-BN)”, both of which suffer from severe drawbacks that limit their 
performance. Despite these challenges, we demonstrate here the devel- 
opment and outstanding neutron detection capabilities of the layered 
semiconductor LilnP,Se,. 

The selenophosphate compound LilnP.Se, is a 2D layered semicon- 
ductor that offers the requisite properties needed to achieve direct 
thermal neutron detection. Bulk LilnP,Se, was synthesized phase pure 
(see Extended Data Fig. la—c) viaa slightly off-stoichiometric solid-state 
reaction at 750 °C (see Methods). LilnP,Se, crystallizes in the trigonal 
space group P3lc with lattice parameters a = b = 6.3975(9) A, 
c = 13.351(3) A (estimated standard deviations in parentheses), 
a=B=90° and y=120° (details of the refinement and structure can be 
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Fig. 1| Structural properties of LilnP,Se,.a, b, Crystal structure of LilInP,Se,: a 
2x 2unit cell looking down the c axis (a) anda single unit cell looking down the 
a-bplane, showing layered structure. c, Source side of the reaction vessel 
containing plate-like crystals of CVT-grown LilnP,Se,.d, Raman spectrum of 
LilnP,Se, acquired at room temperature, showing characteristic vibrational 
modes associated with In-Se bonds and P,Se, units. a.u., arbitrary units. 


found in Extended Data Tables 1, 2). LilnP,Se, comprises polyhedral 
layers separated by van der Waals gaps. Scanning electron microscopy 
images show aclear layered nature for this compound (Extended Data 
Fig. 2c—d). Each individual layer has the structure of Cdl,, where Li, In 
and P, dimers each occupy one-third of the octahedral Cd sites in an 
ordered manner and Se occupies the site, as illustrated in Fig. la. The 
stacking of these layers follows an ABAB sequence such that the InSe, 
polyhedra align whereas the LiSe, and P,Se, polyhedra alternate along 
the c axis (Fig. 1b). This structure is typical among metal selenophos- 
phates with the general formula M*M”"P,Se, (M, metal; refs. °°) and 
derives from a larger layered family based on divalent metal cations 
M**,P,Se, (ref.7°). Raman spectroscopy at an excitation of 633 nm shows 
several well-resolved peaks consistent with the vibrational modes of 
the InSe, octahedra and the [P,Se,]* unit seen in Fig. 1d (see Extended 
Data Fig. 3 for details). 

The melting behaviour of LilInP,Se, was determined through dif- 
ferential thermal analysis (DTA), giving melting and crystallization 
points of 717 °C and 695 °C, respectively (Extended Data Fig. 2a). The 
compound melts congruently, as a single thermal event is observed 
upon heating or cooling, and the powder diffraction pattern after DTA 
shows only LilnP,Se, present (Extended Data Fig. 1g). 

We then developed a more effective approach to grow detector- 
grade, large-size LilnP,Se, single crystals using the chemical vapour 
transport (CVT) method. The temperature of the hot zone (660 °C) 
was below the melting point because otherwise the atmosphere would 
saturate with volatile components and inhibit the transport of the other 
constituents. The cold zone was set to 560 °C, creating a driving force 
for the transport of material from the hot to the cold zone. The initial 
transport reactions of LilInP,Se, were carried out without a transport- 
ing agent and produced tiny LilnP,Se, crystals with a low yield (about 
1mg out of ~0.7 g of starting material) even after a few days of transport 
owing to decomposition into amorphous P,Se, (Extended Data Fig. 1d). 

This issue was resolved by the use of iodine as a transporting agent, 
as the formation of InI, and Lil enables the transport of each metal 
component to balance the vapour transport”. lodine-assisted CVT 
enabled the growth of large single LilInP,Se, crystals with an area of 
about 1 cm’ and thickness of -0.05-1 mm for the week-long reactions 
in Fig. 1c. To reduce the number of nucleation sites and improve the 
yield of large crystals, we reversed the zones for 24 hto ensure that all 
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Fig. 2|Band structure and optical properties of LiInP,Se, single crystals. 

a, b, Electronic band structure (a) and density of states (number of states per 
unit cell volume per electronvolt) (b) for LilnP,Se, calculated using the 
PBE+vdW functional ona fully relaxed structure. The inset inb shows the 
Brillouin zone. CBM, conduction band minimum; VBM, valence band 
maximum; F, energy; £;, Fermi energy. c, UV-Vis-NIR transmission spectrum of 
LilnP,Se,. The inset shows the corresponding Tauc plot. a, absorbance; £,, 
bandgap energy. d, PLspectrum of LilnP,Se, obtained at 12.5K and 2 mW, with 
three Gaussian fit peaks. The inset shows an image of the PL emission. 


LilnP,Se, was at the source side. The crystals produced by the transport 
reactions without this first step are smaller and often intergrown with 
each other owing to multiple nucleations (Extended Data Fig. le). The 
reversal of the hot and cold sides at the end of the reaction limits the 
deposition of the gas phase species onto the as-grown crystal surfaces, 
which is necessary for high-quality, clean surfaces that permit good 
electrical contact with the electrodes. The X-ray powder diffraction 
pattern of ground CVT-grown LilnP,Se, crystals matches the simulated 
pattern and exhibits identical thermal behaviour to the bulk material 
(Extended Data Figs. lf, h, 2b), confirming the purity of the transported 
LilnP,Se, single crystals. 

Density functional theory calculations of the electronic band struc- 
ture were carried out using the PBE+vdW functional, revealing LilnP,Se, 
to be an indirect-gap semiconductor with a bandgap transition 
from near the K point at the valence band maximum to the point at 
the conduction band minimum for a bandgap of 0.94 eV (Fig. 2a). The 
conduction band is composed of hybridized Ins and Se pstates associ- 
ated with the empty In 5s lone pair, whereas the valence band is made 
up of Se p states, as seen in Fig. 2b. The conduction band has a high 
curvature, corresponding toa loweffective electron mass (m;) of 0.16m, 
and 0.30m, for the in- and out-of-plane directions, respectively (m., 
electron mass). The valence band maximum lies along the out-of-plane 
direction in reciprocal space (K to H), and the band in this direction is 
very flat, so the hole effective mass in the out-of-plane direction will 
be immense. Accordingly, we expect that the electron mobility should 
be vastly superior in LilnP,Se,, especially given that the planar nature 
of the material forces devices to operate in the out-of-plane direction. 

Ultraviolet-visible-near infrared (UV-Vis-NIR) transmission and 
reflection measurements on a LilnP,Se, single crystal were used to 
measure the bandgap, which was determined to be 2.06 eV (Fig. 2c). 
Photoluminescence (PL) measurements on LiInP,Se, at low temperature 
(12.5 K) under an excitation of 405 nm showed red-orange emission from 
the as-grown surface of the single crystal, with two broad emission bands 
observed with peak maxima near 1.73 eV and 2.05 eV (Fig. 2d). Excitation 
intensity and temperature-dependent PL measurements support the 
assignment of the 1.73 eV peak to donor-acceptor pair recombination 
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Fig. 3| Electrical characteristics and pulse-height spectra of LiInP,Se, 
devices irradiated by a-particle and y-ray sources. a, Dark current and 
photocurrent (absolute values) as a function of voltage (from -100 V to100 V) 
for CVT-grown LiInP,Se, (~0.3 mm thickness) with gold electrodes illuminated 
by ambient light (-0.2 mW cm”). b, Pulse-height spectra of a particles from 
414m for a LilnP,Se, device (-120 um thickness) for electron collection at 
various voltages, collected for 60s each. c, Pulse-height spectrum ofa 
LilnP,Se, detector (-0.5 mm thickness) under a bias of 700 V illuminated by 
y-rays from*‘Am and Co, and without a source. The counts for *’Co y-rays and 
for the natural background are negligible, as shown. The spectra were collected 
for 200s each. d, Pulse-height spectrum at 700 V with the same conditions as in 
b. Inset, Dark current measured with the voltage varying from Oto 1kV, to-1kV, 
andto 0 Vunder the same conditions as ina. The contact area onthe crystals in 
a-d was 3 x3 mm’. The shaping time in b—-d was 1 ps. 


and of the 2.05 eV peak toa free-to-bound transition; a detailed discus- 
sion may be found in Methods section ‘Notes on Extended Data Fig. 3’. 
Despite these defect-related peaks, astrong PL signal from an indirect- 
gap material is an indication of good optical quality. 

LilnP,Se, detector devices were fabricated from large CVT-grown 
single crystals using evaporated planar gold contacts. Pristine LilnP,Se, 
devices exhibit very low dark current, of the order of picoamperes 
(Fig. 3a), corresponding to a high electrical resistivity of around 
10” O cm. The dark-current plot shows a non-zero current at O V, 
though the dark current exhibits linearity in this range (Extended Data 
Fig. 4a). When larger voltages are applied a diode-like current-voltage 
curve is observed, whereas the low dark current is maintained at 2nA 
at 1,000 V (Fig. 3d), which is more than sufficient for effective a- and 
neutron-detection measurements. This behaviour might originate from 
blocked/injected current from the electrodes under a large electric 
field of up to tens of kilovolts per centimetre. When exposed to ambient 
light (-0.2 mW cm”), photogenerated charge carriers in the LilnP,Se, 
detector increase the current by three to four orders of magnitude 
(Fig. 3a). This is an exceptional photoresponse that indicates favour- 
able charge transport properties in LilnP,Se,, as the photogenerated 
carriers efficiently traverse the detector thickness. 

As a direct-conversion semiconductor based on lithium, neutrons 
(n) are captured in LilInP,Se, through the following nuclear reaction: 
n+°Li=*He+°H (energy Q=4.78 MeV). Thus, the interaction of a parti- 
cles produced by the decay of "Am (Q=5.486 MeV) mimics the energy 
deposition of a neutron-capture event at the detector surface. 

The pulse-height spectrum of a LilnP,Se, device with a thickness of 
120 um irradiated by 5.486 MeV“Am a particles for electron collection 
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Fig. 4 | Thermal-neutron spectra resolved by °LilnP,Se, devices. a, Binned 
pulse-height spectrum of a °LilnP,Se, device (showninb) exposed toa 
moderated Pu-Be source without shielding (red) and with Cd shielding 

(0.125 inch thick; black) and toa source ~5 times weaker (grey) for 300 V bias, 
3 ps shaping time and 30-min measurements. The counts from channel 16 and 
above correspond to neutron-capture events. The count rate for the 
unshielded measurement was 0.95 s1cm™. b, The7 x 7mm?° LilnP,Se, device 
(90 um thickness) used for the measurements shownina. 


can be seen in Fig. 3b. Here the channel number reflects the charge 
collection efficiency (CCE), which is the ratio of collected and gener- 
ated charge froma single event (for example, neutron capture), and 
under constant bias can be considered proportional to the energy of 
the incident particle. The peak from the interaction of a particles shifts 
to higher channel numbers with increasing voltage (Extended Data 
Fig. 5),as higher CCE can be achieved with higher applied electric field. 
The energy resolution for the full-energy peak at 700 Vis 13.9% for the 
LilnP,Se, detector, confirming that LilnP,Se, can accurately resolve the 
charged particles generated by neutron interaction. To our knowledge, 
this is the highest energy resolution reported for “Am a particles 
for any direct-conversion thermal-neutron semiconductor detector, 
surpassing the 23.3% resolution” of LilnSe, or the non-resolution of 
the full-energy peak in h-BN due to poor CCE. 

These results are reproducible for different growth sizes, and the 
a-particle response of other LilnP,Se, devices with crystals grown under 
identical conditions can be seen in Extended Data Fig. 6a—h. For hole 
collection in atypical sample, a shoulder, instead ofa full-energy peak, 
is usually observed, implying relatively poor CCE of holes. Nevertheless, 
several devices were able to resolve the “Am peak for both holes and 
electrons. This is because the high electric field that this material can 
withstand (at least 60 kV cm") allows the generated carriers to be swept 
out to the contacts before trapping or recombination can occur, result- 
ing in aspectroscopic response even for the relatively low-mobility 
holes. For aconstant applied voltage across various samples, the CCE 
is higher for thinner samples because of the larger external electric 
field. These results conclusively demonstrate that LilnP,Se, devices can 
achieve a high-resolution spectroscopic response to*Am a particles. 

Isotopically enriched lithium (95% °Li) was used to grow LiInP,Se, 
crystals (hereafter labelled °LilnP,Se,) with maximal °Li to maximize 
the thermal-neutron-capture cross-section. The a-particle response of 
the °LilnP,Se, devices is similar to that of their unenriched analogues, 
with full resolution of the “Am peak for electron collection (Extended 
Data Fig. 7a-e). Hole collection of °LilnP,Se, devices also demonstrates 
some level of a-particle response. 

The thermal-neutron response of the 95% °Li-enriched detectors was 
tested at room temperature using a moderated and very weak Pu-Be 
source, which generated roughly 75 neutrons per second per square 
centimetre. The absorption probability for thermalized neutrons ver- 
sus the thickness of °LilnP,Se, is shown in Extended Data Fig. 8. The 
device had an active area of 7 x 7 mm? and a thickness of -90 pm, and 
was subjected to a bias of 300 V. The binned pulse-height spectrum 
of a°LilnP,Se, device under neutron irradiation shows a defined peak 


that is clearly resolved from the background, demonstrating success- 
ful direct detection of neutrons (Fig. 4a). The pulse-height spectrum 
shows a broad full-energy peak corresponding to the neutron-capture 
reaction close to the dark-current noise background edge. When cad- 
mium shielding (0.125-inch-thick; 1 inch =2.54 cm) was placed in front 
of the device to block most of the thermalized neutrons, the count rate 
was reduced by 87%. This reduction stems from the high absorption of 
neutrons belowthe cadmium cutoff energy (0.5 eV; ref. ””). The energy 
resolution of this neutron-capture peak was 69.7%. This response is 
very promising at this early stage. 

An ideal neutron detector should be insensitive to y-rays to ensure 
that false neutron counts are not generated from incident y radiation. 
The ability of the LilnP,Se, device to discriminate against y-rays was 
evaluated ona thicker detector (~-0.5 mm) exposed to a 0.2-mCi*’Co 
y-radiation source (a photon flux multiple orders of magnitude higher 
than the neutron source discussed earlier). Despite the higher flux, the 
pulse-height spectra of the ambient background and under direct y 
irradiation shown in Fig. 3c are essentially identical, demonstrating 
the negligible y sensitivity of this material. 

Inconclusion, LilnP,Se, isthe first, to our knowledge, direct thermal-neu- 
tronsemiconductor detector with intrinsic °Lito demonstrate a resolved 
full-energy neutron peak above the background. Further improvements 
inthe neutron detection efficiency could be achieved by the growth of 
larger single crystals, such as through the Bridgman growth method. Fur- 
thermore, spectroscopic” Am a-particle detection is achieved with 13.9% 
energy resolution. The excellent semiconducting properties and radiation 
response achieved here demonstrate that LilnP,Se, has the potential to 
transform neutron detection technology, which relies almost exclusively 
onscintillation paradigms, and usher inthe semiconductor modality for 
direct-conversion thermal-neutron detection. 
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Methods 


Chemical reagents 

Lithium (99.9% Sigma Aldrich), indium (99.99% American Elements), 
phosphorus (99.999% Alfa Aesar), selenium (99.99% American Ele- 
ments) and iodine (99.999% Sigma Aldrich) were used as received. 
For the material used in the detectors, high-purity indium (99.9999% 
Alfa Aesar) and selenium (99.999% Alfa Aesar) were used. Isotopically 
enriched lithium (95% °Li) was used to synthesize materials for the 
neutron detection measurements. 


Synthesis of Li,Se 

Li,Se was synthesized using a modified literature procedure”. In an 
argon-filled glovebox, lithium metal was removed from mineral oil 
and the oxidized edges were cut off. The lithium metal, selenium anda 
stir bar were added into a500-mL three-neck flask. The reaction vessel 
was attached to a custom apparatus that allowed an ammonia stream 
to condense liquid ammonia into the vessel under flowing argon. In 
a typical reaction, ~300 mL of ammonia was condensed using a cold 
finger containing an acetone-dry-ice mixture. The reaction continued 
under reflux and stirring until the ammonia colour changed from deep 
blueto clear. After the completion of the reaction, the liquid ammonia 
was boiled off and the vessel material was placed under vacuum over- 
night. The material was stored in a N, glovebox. Ina typical reaction, 
15 g of material was synthesized with a Li:Se ratio of 2:1.002. The slight 
excess of selenium was to ensure that the reaction goes to completion. 


Synthesis of Li, ,,In 

In an argon-filled glovebox, In (2.535 g) and Li (0.147 g) chunks were 
loaded into a glassy carbon crucible with a lid. This crucible was then 
placed ina fused quartz silica tube and the tube was flame-sealed under 
a pressure of ~2 x 10°° mbar. The tube was placed ina vertical furnace 
and subjected to the following heating profile: heat to 700 °C over 
10 h, soak for 2 h, and then cool to ambient temperature. The ingot 
was extracted and stored in a nitrogen box. 


Synthesis of P,Se, 

For P,Se,, red phosphorus pieces (1.356 g) and selenium shot (8.644 g) 
were loaded into a 13-mm fused-silica tube in a stoichiometric ratio. 
The tube was flame-sealed under vacuum (-2 x 10? mbar). The reagents 
were physically mixed by shaking the tube. The tubes were placed in 
a tilted tube furnace so that the tops of the tubes were at the centre 
of the furnace. The following temperature profile was used: heat to 
500 °C over 12h, hold the temperature for 72 h and then furnace-cool 
to ambient temperature. The tubes were opened ina nitrogen-filled 
glovebox for storage. 


Synthesis of bulk LilnP,Se, 

Owing to the air-sensitive nature of Li,Se, the reagents were manipu- 
lated in a nitrogen-filled glovebox. For the bulk synthesis of LilnP,Se,, 
Li,Se (0.437 g), In (1.030 g), P,Se, (4.181 g) and Se (0.351 g) were loaded 
into a carbon-coated, 13-mm fused-silica tube at a molar ratio of 
Li:In:P:Se =1.05:1:2.04:6.12. Alternatively, Li, ,;In (2.500 g), P,Se; (9.643 g) 
and Se (1.667 g) could be used as the starting reagents, and were used for 
the synthesis of detector-grade crystals. Synthesis of chalcogenides con- 
taining Litypically require crucibles that can withstand chemical attack 
from Li, suchas glassy carbon or pyrolytic BN“. In the present synthesis, 
the tubes were carbon-coated to prevent glass attack. The tubes were 
flame-sealed under vacuum (-2 x 10° mbar). The tubes and thermocouple 
were placed inthe same manneras in the reaction of P,Se;. The following 
optimized heating profile was used: heat to 750 °C over 10 h, hold the 
temperature for 24 h, cool to 350 °C over 12 h and then furnace-cool to 
ambient temperature. The tubes were opened in air, revealing an ingot 
comprised of deep red plates. The synthesis of bulk °LilnP,Se, was per- 
formed using °Li, ,,In, P,Se, and Seas the starting reagents. 


CVT 

Preformed material from the bulk synthesis was used as the source 
for the CVT growth of LilnP,Se,. In a typical synthesis, approximately 
0.5-5 g of source material was used, accompanied by 16-30 mg of 1,—the 
amount of I, used depends on the inner diameter of the fused-silica 
tube (11-16 mm). The tube was sealed to a length of 17 cm while the 
bottom was submerged in liquid nitrogen to prevent loss of iodine. 
The tube was then placed in a two-zone furnace and subjected to the 
following heating profile: source zone: heat to 560 °C over 6 h, hold 
the temperature for 12 h, heat to 660 °C over 6h, hold for 168 h and 
then cool to ambient temperature over 6 h; sink zone: heat to 660 °C 
over 6h, hold for 12 h, heat to 560 °C over 6h, hold for 170 hand then 
cool to ambient temperature over 10 h. After the transport process, 
deep-red plates were found on the sink side, and the source material 
was consumed. The vacuum of the tube was carefully breached by 
slowly cutting into the tube at the source side with a diamond saw to 
make a pinhole opening. This step was necessary to prevent damage 
to the thinner crystals from the influx of air. 


X-ray diffraction 

A Rigaku Miniflex600 X-ray diffraction system equipped with a Dtex 
silicon 1-D detector was used to obtain powder X-ray diffraction pat- 
terns. Cu Ka radiation (wavelength A = 1.5406 A) was produced using 
a voltage of 40 kV and a current of 15 mA and filtered with a graphite 
monochromator anda Kf foil filter. A zero-background silicon holder 
was used. Crystals suitable for single-crystal diffraction were obtained 
using CVT reactions. Single-crystal X-ray diffraction was performed 
using a STOE IPDS II diffractometer operating at 50 kV and 40 mA with 
Mo Ka radiation (A= 0.71073 A) with a graphite monochromatizer. Data 
reduction, integration and absorption correction were performed 
using the X-area software package. The program XPREP was used to 
determine the space group and prepare the data for the structure solu- 
tion. Structure solutions using intrinsic phasing and refinements were 
obtained using SHELXT™ and SHELXL”, respectively. OLEX2 was used 
as a GUI’’. VESTA was used to produce crystal structure figures’. 


Optical property characterization 

UV-Vis-NIR transmission and reflection were measured ona PerkinElmer 
LAMBDA 1050 UV/Vis spectrophotometer in the range 1,500-250 nm 
onasingle crystal of LilnP,Se, at room temperature. These were used 
to obtain the absorption and thereby determine the bandgap using a 
linear fit of the Tauc plot. Raman spectroscopy was conducted ona 
single crystal of LilInP,Se, at room temperature using a confocal Horiba 
LabRAM HR Evolution spectrometer with an excitation wavelength of 
633 nm anda laser power of 25 mW. 

Room-temperature PL was measured in the range 475-1,000 nm 
with the same spectrometer using an above-bandgap excitation wave- 
length of 473 nm. The laser power (25 mW) was varied in the range 
0.25-12.5 mW using neutral density filters. Laser powers above 1.25 mW 
led to visible damage to the sample, and below this threshold no PL 
was observed. Above this point, a broad band appears from 1.4 eV to 
2 eV, probably due to an impurity phase formed upon decomposition 
of the sample. 

The temperature and excitation intensity dependence of the PL were 
measured on a LilnP,Se, single crystal in a closed-cycle He cryostat 
(SHI Cryogenics DE-202). The single crystal was mounted on the stage 
using Apiezon N grease. The excitation source was a continuous-wave 
diode laser (OBIS 405 nm LX; beam diameter 0.8 mm) with the power 
digitally adjusted in the range of 0.5-21 mW. A 405-nm bandpass filter 
(full-width at half-maximum 10 nm; Thorlabs) screened the laser light 
focused onto the sample, and the emitted light was filtered by a450-nm 
long-pass filter (Thorlabs) before reaching the monochromator 
entrance slit (width 200 pm). A chopper was used at 710 Hz toincrease 
the signal-to-noise ratio. PL spectra were resolved by a 500M SPEX 


grating monochromator with a Hamamatsu R928 photomultiplier 
tube biased at 500 V, which was read into a SR810 lock-in amplifier 
(Stanford Research Systems). PL spectra were collected over the range 
of 530-900 nmatarate of1nms7. 


DTA 

For the determination of the thermal properties of LilnP,Se,, a Netzch 
Simultaneous Thermal Analysis (STA 449F3) instrument was employed. 
Approximately 25 mg of the sample was sealed ina fused-silica ampoule. 
Analumina ampoule of similar mass was used as a reference. The heat- 
ing and cooling rates were 10 K min”. 


Band structure 

The band structure calculations were performed using VASP (Vienna ab 
initio simulation package) within the framework of density functional 
theory. The symmetry-unconstrained lattice parameters and atom 
coordinates were relaxed to the local total energy minimum, starting 
from the structure that we measured experimentally. The structure 
optimization was performed by applying the Perdew-Burke-Ernzerhof 
exchange and correlation functional with the van der Waals extension 
(PBE+vdW) using the optB88-vdW functional’’. The electronic-band 
dispersions were calculated for the fully relaxed structures using the 
special k points and the high-symmetry path in the Brillouin zone 
defined by Setyawan and Curtarolo”. The analysis of the data generated 
by the ab initio calculations was carried out with Python scripts with 
extensive use of the Atomic Simulation Environment (ASE) and the 
Python Materials Genomics (pymatgen) materials analysis libraries*. 
The effective masses were calculated using the set of python scripts 
available at the emc repository®. 


Scanning electron microscopy 

A Hitachi 3400 scanning electron microscope was used to image the 
microstructure. The accelerating voltage and probe current were set to 
20 keV and 70 mA, respectively. Energy-dispersive X-ray spectroscopy 
(EDS) was performed using a PGT energy-dispersive X-ray analyser. 
Aztec software from Oxford Instruments was used to analyse the EDS 
data. 


Device fabrication 

Planar-type devices were fabricated using single crystals grown by CVT 
without cutting or polishing, owing to the high surface quality of the 
as-grown crystals. Translucent and flat crystals that were visibly defect- 
free were selected. Gold electrodes with a typical area of 3 x 3 mm? 
(unless otherwise stated) and thickness of ~70 nm were deposited on 
both sides of the crystal by thermal evaporation at arate of -1As ‘using 
an SQC-310 Inficon deposition controller. Cu wires were attached to 
the gold electrodes using graphite paste, allowing the crystals to be 
connected to the external circuit used for property measurements. 


Electrical properties and a-particle response 

The electrical resistivity was measured using a Keithley 6517B elec- 
trometer. For dark-current measurements, the device was enclosed in 
alight-tight metal box. The photoconductivity was measured using 
ambient light (0.2 mW cm? 400-800 nm) as the source of excitation. 
Current-voltage characteristic curves from 100 V to—-100 V were used 
to calculate the electrical resistivity. To enable collection of the 
a-particle spectrum, planar-type detectors with evaporated gold con- 
tacts were used for pulse-height spectroscopy. The a-particle source 
employed was “Am, with a typical a-particle kinetic energy of 
5.486 MeV. The detectors were enclosed ina light-tight metal box with 
the source ~3 mm away from the top electrode. The device was con- 
nected to an eV-550 preamplifier and a voltage source that applied a 
voltage of +20-1,000 V to the bottom electrode. An ORTEC amplifier 
(model 572A) was used to amplify and shape the signal from the pre- 
amplifier. The gain and shaping time were set to 100-200 and 0.5-2 us, 


respectively. A dual 16K-input multichannel analyser (model ASPEC- 
927) and MAESTRO-32 software were used to process the amplified 
signal and display the response spectrum, respectively. The Hecht 
equation for a single carrier type is acommon way to estimate the 
2: 

wea - exp(av) | 5 
where dis the thickness of the semiconducting region through which 
the charge carrier travels and Vis the applied voltage. The charge col- 
lection efficiency (CCE) refers to the fraction of charge that is collected 
versus the generated charge. 


mobility-lifetime product pt (ref. **) as CCE = 


Neutron detection 

The same setup used to measure the pulse-height spectrum of a 
particles was used to determine the neutron response. The source 
of neutrons used was a polyethylene-moderated PuBe source. A *He 
proportional tube was used to determine the neutron flux of the source 
so that the absolute efficiency of the devices could be calculated. The 
°LilnP,Se, device was placed at the same distance away from the source 
as the *He detector. Collection times were 30 min. The number of chan- 
nels was reduced from 16,384 to 128 by summing the counts from 128 
channels into a single channel. 


Notes on Extended Data Fig. 1 

Note for bulk synthesis and chemical stability. The optimized synthe- 
sis of LilInP,Se, involves a relatively small excess of Li/P/Se to synthesize 
phase-pure material. Whena stoichiometric ratio of reagents was used, 
secondary phases were present (see Extended Data Fig. 1c). The need 
for additional P and Se may stem from the imbalanced vapour pressure 
of different vapour components, which probably induces a P/Se-rich 
atmosphere, leaving the melt deficient in P/Se. For a few reactions, 
even when the carbon coat peeled off the silica wall, there were no 
signs of glass attack, suggesting that the as-formed LilnP,Se, material 
does not react with silica. Bulk LilnP,Se, shows good stability under 
ambient atmospheric conditions. However, when the ingot is finely 
ground, the resulting powder turns from bright orange to dark brown 
after several minutes of air exposure. Both the ingot and powder were 
very sensitive to liquid water and appeared to exfoliate after several 
seconds in water, potentially from water intercalating into the layers. 


Gas phase equilibrium in the CVT reaction. Because this vapour 
transport experiment involves a quinary system when considering the 
iodine, the gas-solid phase equilibrium is quite complex. For a given 
material to transport, every element present in the material must have 
some species that favours the gas phase. Typically, metal selenides can 
be transported using iodine through the formation of the gas phase spe- 
cies, M"'I,,,and Se, dimers”. Phosphorus is relatively volatile and trans- 
ports well through the formation of either P, tetramers or PI, (ref. *°). 
By contrast, the transport of alkali metals is more difficult® as alkali 
metal compounds typically have low vapour pressures at moderate 
temperatures and thus transport poorly. Liis probably transported 
as Lil, given its relatively low melting point of 469 °C and boiling point 
of 1,171°C. For crystals grown without employing the reversal of zones 
at the end of the growth, EDS mapping (Extended Data Fig. 2e) of the 
surface demonstrated a growth front rich in iodine, demonstrating the 
involvement of iodine in the transport of LilnP,Se,. 


Notes on Extended Data Fig. 3 

Room-temperature PL measurements on a LilnP,Se, single crystal 
showed no response to excitation at 473 nm except upon decomposi- 
tion at high laser powers, when a broad band centred at 1.7 eV appears 
(Extended Data Fig. 3). This lack of PL response may be due to the indi- 
rect gap of the compound, which reduces the probability of radiative 
recombination due to the momentum difference between the band 
edge states, requiring phonons to mediate the transition. Thus, low- 
temperature PL was used to determine the optical quality of these 
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crystals and investigate whether any defects were present. As observed 
in Fig. 2d, LilnP,Se, exhibits two broad PL peaks at low temperature 
(12.5 K), which can be modelled by three Gaussian peaks. A two-peak fit 
was attempted, but it was found to require the use of an exponentially 
modified Gaussian peak as described in ref. ” to adequately model the 
asymmetry of the higher energy band. However, the use of sucha peak 
is not justified here because the source of emission is not the result ofa 
distribution of phonon-related peaks. Hence, these emission bands are 
best fitted using three Gaussian peaks, where peaks 2 and 3 correspond 
tothe same broad band so they are expected to show similar behaviour. 
To further characterize the PL emission at 12.5 K, the PL dependence 
on the excitation intensity was tested by increasing the laser power 
from 0.5 mW to 21 mW, resulting in corresponding enhancements in 
the emission intensity and a slight blueshift in the peak maxima for all 
three peaks (Extended Data Fig. 3a). 

The PL intensity /has a power-law dependence on the laser power Lin 
the form/« L* with the behaviour governed by the exponent k (ref. °°). 
Excitonic emission shows superlinear behaviour with increasing exci- 
tation intensity, corresponding to a coefficient of 1< k <2, whereas 
free-to-bound and donor-acceptor pair (DAP) recombination pro- 
cesses have a power-law coefficient k below 1. This coefficient can be 
derived from the slope of a plot of log/ versus logl for each peak, as 
shown in Extended Data Fig. 3b. Each of the three peaks observed in 
LilnP,Se, has a power-law coefficient below 1 (Extended Data Fig. 3b), 
so these emissions are a result of either free-to-bound transitions or 
DAP recombination. We note the essentially identical behaviour of 
peaks 2 and 3; these peaks correspond to the same band and thus 
have the same power-law coefficient of 0.91+ 0.04. The temperature 
dependence of PL at 2mW demonstrates that the higher-energy emis- 
sion quenches first, with peaks 2 and 3 merging at 70 K and vanishing 
by 90K, and peak 1 persisting until about 150 K (Extended Data Fig. 3c). 
The energies of peaks 2 and 3 blueshift slightly as the temperature 
rises, whereas peak 1 redshifts dramatically between 12.5 K and150K 
(Extended Data Fig. 3d). This redshift causes the low-energy tail of 
peak 1 to fall past the response limit of the photomultiplier tube 
(1.46 eV), producing an artificial asymmetry; thus, above 55 K the 
fit ignores data below 1.48 eV to accurately reproduce the Gaussian 
peak shape. 

PL quenching occurs when a nonradiative recombination process 
competes for the photogenerated carriers involved in the emission. The 
temperature dependence of this quenching behaviour can be modelled 
using an Arrhenius plot, following the frequently used expression”: 


E -1 
I(T) hia exo(-{3)| (1) 


where /, is the PL intensity at O K, ais aconstant, kis the Boltzmann 
constant and F, is the activation energy corresponding to the compet- 
ing nonradiative recombination. This model corresponds to the sim- 
ple case of a single nonradiative transition, and adequately describes 
the Arrhenius plots of the integrated PL intensity versus the inverse 
temperature, as indicated by the solid lines in Extended Data Fig. 3e. 
The activation energies of the nonradiative quenching processes are 
determined to be 40.0 + 6.6 meV, 15.2 + 6.2 meV and 16.8 + 3.4 meV for 
peaks 1, 2 and 3, respectively. Again, the behaviours of peaks 2 and 3 
are quite similar, indicating that they should have the same emission 
mechanism. 

The power-law coefficients of all three peaks are consistent with 
either DAP recombination or free-to-bound transitions, whereas the 
energy of each peak blueshifts with increasing excitation intensity, as 
expected for DAP recombination*°. However, the temperature-depend- 
ent behaviour of peak lis markedly different from that of peaks 2 and 3, 
with different energy shifts as the temperature increases and distinct 
quenching processes for the two bands. To assign these accurately, we 
compare the relatively ‘deep’ energy of peak 1 (coming from energy 


levels in the middle of the bandgap) to the ‘shallower’ peaks 2 and 3 that 
emit from energy levels closer to the band edge. DAP recombination 
occurs between a donor and an acceptor band, both of which must lie 
deep enough within the bandgap to avoid being ionized, whereas a free- 
to-bound transition is between a single defect band and the opposite 
band edge, so itis only as deep as the defect band. At 1.73 eV, peak 1lies 
well below the band edge (2.06 eV at room temperature) while peaks 2 
and3 are much closer to the bandgap at 1.98 eV and 2.06 eV, respectively. 
Together with the difference in temperature-dependent behaviour, 
this supports the assignment of peak 1 to DAP recombination and of 
peaks 2 and 3 toa free-to-bound transition. The PL measurements are 
summarized in Extended Data Fig. 3f, along with these tentative peak 
assignments. These results reveal the presence of at least two defect lev- 
els, which should be identified and removed. Nevertheless, the presence 
of astrong PL signal from an indirect-gap material is an indication of 
good optical quality and the removal of these defects promises further 
improvements in the detector performance of LiInP,Se,. 
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Extended Data Fig. 1| X-ray powder diffraction patterns of LilnP,Se, 
prepared under different conditions. a, b, Experimental versus simulated 
powder diffraction pattern of LilnP,Se, scaled to the largest peak (a) and scaled 
by a factor of 20 (b). The simulated pattern has a March-Dollase parameter of 
0.3. The X-ray powder diffraction pattern of LilnP,Se,inashows a considerable 
preference to orientation along <0 O J), that is, the peaks with the largest 
intensity correspond to the (0 02), (00 4) and (008) planes. This preferred 
orientation stems from the layered nature of this compound, which causes the 
layers to lie parallel to the sample holder. The simulated pattern with a March— 
Dollase parameter of 0.3 was the reference pattern used to compare with the 
experimental pattern, and accurately accounts for the sample’s preferred 
orientation. When the patterns are zoomed-in closer to the baseline, the 
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reflections with contributions fromthe hand k directions can be seen and 
match well with the simulated pattern (b). c, Experimental versus simulated 
powder diffraction pattern of LilInP,Se, obtained using a stoichiometric 
amount of reagents. The unknown secondary phases are marked by asterisks. 
d, Sink side of the reaction tube used for CVT with no iodine charged into the 
tube. e, Sink side of the reaction tube used for CVT with iodine as the 
transporting agent. This reaction did not employ an initial reverse transport 
step. f, Experimental versus simulated powder diffraction pattern of LilnP,Se, 
grown using CVT. The simulated pattern has a March-Dollase coefficient of 
0.3. g, Experimental powder diffraction pattern of bulk LilInP,Se, before (red) 
and after (black) DTA up to 760 °C. h, Experimental diffraction pattern of 
LilnP,Se, grown by CVT before (red) and after (black) DTA up to 760 °C. 
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Extended Data Fig. 2| Thermal analysis, morphology and compositional 
analysis for LiInP,Se,.a, b, DTA measurement of bulk-synthesized (a) and CVT- 
grown (b) LilnP,Se, up to 760 °C. The vertical axis is a measure of the voltage 
difference of the thermocouples onthe sample and the alumina standard as the 
temperature is varied; thus, the spikes correspond toa transfer of heat fromthe 
sample to the environment and vice versa. The melting point of LilnP,Se, is 
considerably lower than that of LilnSe, (-915 °C), which yields several practical 
benefits, including the reduction of thermally induced defects and improved 
stoichiometry from reduced evaporative losses of the volatile Li, Pand Se. 

c,d, Scanning electron microscopy images of bulk LilnP,Se, in the basal plane 


250um 


(c) and across-section of the layers (d). The lamellar crystal morphology of the 
bulk ingot can be clearly seen in the images. A large flake with asmooth surface 
was selected to examine the surface of the basal plane. c shows a typical image 
of the smooth basal plane. The composition obtained from EDS from both 
orientations yielded the semiquantitative formula In, ,P3oS€¢,o, whichis 
consistent with the expected ratio of 1:2:6 determined from the crystal 
structure. The lithium content could not be determined from EDS because of 
theinsensitivity of EDS to lithium. e, EDS mapping of the surface of LilnP,Se, 
grown using CVT without reverse transport. The growth front of the layer 
appears richiniodine. 
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Extended Data Fig. 3| PL characterization of LiInP,Se,.a, Dependence of the temperature. e, Arrhenius plots of integrated PLintensity versus inverse 
PL spectra of LilnP,Se, at 12.5 K on laser power. b, Power-law fits of the temperature. The solid lines are the least-squares best fit for each peak 
integrated PL intensity versus power for each peak. c, Temperature according to Eq. (1). f, Tabulated PL peak properties for LilnP,Se, at 12.5K. 


dependence of the PL spectra of LilnP,Se, at 2 mW. d, PL peak energy versus See Methods for further discussion. 
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Extended Data Fig. 4 | Electrical properties and spectroscopic response of 
LilnP,Se, device. a, Current-voltage plot (from -100 V to100 V) ofa pristine 
LilnP,Se, device showing linear behaviour. b, Pulse-height spectra of a 
particles from an™ Am source at 700 V witha Gaussian fit (blue curve). 

c, Binned pulse-height spectrum of a7 x 7 mm?°LilnP,Se, device (90 pm 


o>) 
= 


‘ X-xC)/w)*2) 
——nshielded at 300V 


Gauss Fit B 


4 


Adj.R-Sq 0.72822 


sigma 
FWHM 
Height 


DDnowdwWDwD 
> 


Value 
0 
26.00778 
15.39988 
794.052 
7.69994 
18.13197 
41.148 


Equation y=y0 + (A/(w*sqrt(PI/2)))*exp(-2*(( 


Standard 
0 
0.46723 
1.24503 
49.392 


16 32 48 64 
Channel Number 


thickness) exposed toa moderated Pu-Be source while unshielded under 300 V 
bias for 30 min, and the corresponding Gaussian fit (blue curve). The tables in 
b, cshowthe Gaussian equation used to fit the full-energy peak and values of 
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Extended Data Fig. 5 | Channel number versus voltage for LiInP,Se, devices 
collecting electrons. The figure of merit for charge transport in 
semiconductor radiation detectors is the mobility-lifetime product, yt. For 
reliable neutron detection, zt must be sufficient so that counts from neutron 
capture are at a sufficiently high channel number to be distinct from the 
background to allow accurate discrimination. Here the channel number is 
plotted versus voltage for a typical device, and surprisingly the curve exhibits 
two slopes in different voltage ranges. The Hecht equation cannot model these 
data because the model assumes linear behaviour at low voltages, which then 
saturate towards amaximum channel number as the CCE reaches unity. The 
observed behaviour has the opposite concavity to the model, with aslight rise 
in slope as voltage increases. This change in slope can be explained by the 
reduction of the effective voltage from an internal polarization antiparallel to 
the applied field. Similarly atypical behaviour has been observed inthe 
a-particle response of cadmium zinc telluride at T=200K, where this effect 
was attributed to polarization from deep-level traps“. 
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Extended Data Fig. 6 | Pulse-height spectra of four other samples. a-h, Pulse- 
height spectra are shown for electron (a—d) and hole (e-h) collection. The 
growth conditions of these samples were the same as those used for the 


samples that gave the spectra shown in the main-text figures. For hole 


collection, the spectra had lower channel numbers than the electron-collection 
spectra, indicating lower CCE for hole transport. 
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Extended Data Fig. 7 | Pulse-height spectrum for °Li-enriched LilnP,Se, 
devices. a—e, Data are shown for °Li-enriched LilnP,Se, devices with areas of 
7x7mm? (a,b), 55mm? (c) and3 x3 mm? (d, e) . Devices with larger electrode 
areas had lower energy resolutions for electron collection, possibly owing to 
property variation among different samples. f, As-made LilnP,Se, detectors 
with a diameter of 2mm (‘1’) and5 mm (‘2’) onthe same single crystal. g, Time- 
dependent a-particle spectra collection by LilnP,Se, detector 2 inf. We note 
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that the peak channel number and count rate per second were constant over 
time. h, Collection of a-particle spectra by detector 1linf witha pulse generator. 
i, Collection of a-particle spectra by detector 2 inf with a pulse generator. We 
note that the positions of the pulser peak inh andiare the same and the 
corresponding energy resolution is (3.5 + 0.1)%, indicating the same 
electronics noise during collection. 
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Extended Data Fig. 8 | Absorption probability versus °LilnP,Se, thickness 
for thermalized neutrons. In addition to charge transport properties, the 
neutron-capture cross-section of LiInP,Se, must be considered, as this directly 
correlates with the intrinsic maximum of the detector efficiency. Alinear 
attenuation coefficient can be used to quantify the percentage of incident 
neutrons absorbed ina given material through the equation: 

fraction of neutrons captured (%) = (1- e™") x 10%, where /is the thickness of the 
active region and ais the linear attenuation coefficient, which is calculated 
using the capture cross-section of thermalized neutrons for each element and 
its molar density. The calculated mass attenuation coefficient for thermalized 
neutronsis5.1cm7and 1.4 cm for LilnP,Se, fully enriched in °Li and natural- 
abundance Li, respectively. A detector witha thickness of about 9 mm would be 
able to absorb 99% of the incident neutrons. Here we achieved the successful 
growth of crystals with thickness of -1mm, which would absorb -40% of 
incident neutrons. Increases in efficiency would be achievable through scaled- 
up growth of thicker crystals or stacking of several thinner detectors”. When 
451m (the most common isotope in natural indium) absorbs a neutron, the 
nuclide produces a y-ray instead of a highly energetic charged particle. Thus, 
about 20% of the neutrons absorbed do not contribute to the signal, which sets 
the maximum theoretical detector efficiency of °LilnP,Se, to approximately 
80%. 
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Extended Data Table 1| Crystal data and structure refinement for LilnP,Se, at 293 K 
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0.4 x 0.2 x 0.05 mm* 


6 range for data collection 
Index ranges 


Reflections collected 


Independent reflections 


Completeness to 6 = 
25.242° 


Refinement method 


Data / restraints / 
parameters 


Goodness-of-fit 

Final R indices [I > 20(I)] 
R indices [all data] 
Extinction coefficient 


Largest diff. peak and hole 
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Extended Data Table 2| Atomic coordinates (x10*), equivalent isotropic and anisotropic displacement parameters («10° A2), 
selected bond lengths (A) and angles (°) for LilnP,Se, at 293(2) K, with estimated standard deviations in parentheses 


Label x y 
In(01} 10000 10000 


Zz Occupancy 


7600 1 


Se(01) 6613(1) BB77(1) B204(1) 1 


P(01) 3333 6667 
Li(01) 6667 3333 


BB72(2) 1 
7500 1 


Ueq U4 
( 


16 
15 
11 
37 


“Ueq is defined as one third of the trace of the orthogonalized Uj tensor. 


The anisotropic displacement factor exponent takes the form: -2m7[h2a"2U41 +... + 2hka"b'Urz]. 
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P(01)-Se(01)}-In(01) 
P(01)-Se(01)-Li(01) 
$e(01)'-P(01)-Se(01) 
$Se(01)'-P(01)-P(01)' 
$e(01)-Li(01)-Se (01) 
$e(01)-Li(01)-Se (01) 
$e(01)-Li(01)-Se (01) 
$e(01)'-Li(01}-Se(01) 


P(O1)-S 
P(o1}-P 


Distances (A) 
2.7576(4) 
2.1859(6) 
2.212(3) 
2.7850(4) 


1) 12(1) 
(1) 10(1) 
1) 8(1) 
(5) 13(4) 


B(1) 0 0 
71) 14) 21) 
41) 0 0 
72) 0 0 


Angles (°) 


178.527( 
94 232( 
84 .780( 
96.227( 
94 .233( 
83.944( 
( 
( 
( 
( 


13) 
6 
3 
6 
6 
2 
101.88(4) 
) 
) 
) 
1 
6 
5 
3 


3 
16) 
13) 
16) 
16) 
12) 


101.65(4 
112.17(4 
106.61(5 
178. 760( 
94 .205(1 
95.887(1 
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Transparent piezoelectrics are highly desirable for numerous hybrid ultrasound- 
optical devices ranging from photoacoustic imaging transducers to transparent 
actuators for haptic applications! ’. However, it is challenging to achieve high 
piezoelectricity and perfect transparency simultaneously because most high- 
performance piezoelectrics are ferroelectrics that contain high-density light- 
scattering domain walls. Here, through a combination of phase-field simulations and 
experiments, we demonstrate a relatively simple method of using an alternating- 
current electric field to engineer the domain structures of originally opaque 
rhombohedral Pb(Mg,;Nb,/;)03-PbTiO; (PMN-PT) crystals to simultaneously 
generate near-perfect transparency, an ultrahigh piezoelectric coefficient d;, (greater 
than 2,100 picocoulombs per newton), an excellent electromechanical coupling 
factor k;; (about 94 per cent) and a large electro-optical coefficient y;; (approximately 
220 picometres per volt), which is far beyond the performance of the commonly used 
transparent ferroelectric crystal LiNbO,. We find that increasing the domain size leads 
toa higher d,, value for the [001]-oriented rhombohedral PMN-PT crystals, 
challenging the conventional wisdom that decreasing the domain size always results 
in higher piezoelectricity*® °. This work presents a paradigm for achieving high 
transparency and piezoelectricity by ferroelectric domain engineering, and we expect 
the transparent ferroelectric crystals reported here to provide a route to a wide range 
of hybrid device applications, such as medical imaging, self-energy-harvesting touch 


screens and invisible robotic devices. 


Achieving simultaneous high piezoelectricity and perfect transparency 
in a piezoelectric material has long been a challenge. For example, 
traditional high-performance piezoelectric transducers are typically 
made from perovskite ferroelectric ceramics and crystals with chemical 
compositions that are close to their morphotropic phase boundaries 
(MPBs), such as Pb(Zr,Ti)O, (PZT) ceramics and domain-engineered 
PMN-PT crystals. These materials possess very high d,, and k;, values" “, 
but they are usually opaque in the visible-light spectrum. On the other 
hand, the commonly used transparent piezoelectric LINDO; crystals and 
polyvinylidine fluoride (PVDF) polymers®’ have good transparency but 
much lower d;; and k,; values (LINDO,: d;;< 40 pC N“4, k33 = 47%; PVDF: 
d;;~ 20 pC N“, k;3 ~ 16%) that severely limit the acoustic source level, 
bandwidth and sensitivity of the transducers. 

In addition to the extrinsic effects, such as porosity and grain 
boundaries, which are ubiquitous in ceramics, the poor transparency 
in PZT ceramics and domain-engineered PMN-PT crystals is closely 
associated with light scattering and reflection from their ferroelec- 
tric domain walls. There are two possible approaches to reducing the 


light-scattering domain walls. The first is to pole a ferroelectric crystal 
along the polar direction to achieve a single-domain state. However, 
the d,, value of such single-domain PMN-PT crystals is generally very 
low?*—much lower than that of [001]-poled multidomain rhombohe- 
dral PMN-PT crystals (>1,500 pCN’). In principle, one could first pole 
arhombohedral PMN-PT crystal along the [111] direction to achieve a 
single-domain state with good transparency, then rotate the crystal 
to the [001] direction to guarantee high longitudinal piezoelectric- 
ity. However, this approach is not feasible in practice (see Methods 
for a detailed explanation). The second approach is to dramatically 
reduce the domain sizes by breaking the domains into polar nanore- 
gions with spatial sizes (a few to tens of nanometres) much smaller 
than the wavelength of visible light, thus greatly improving their light 
transparency—as observed in La-doped PZT®”*. However, improving the 
transparency using polar nanoregions is achieved at the expense of a 
markedly reduced remanent polarization and thus very low d,, values; 
therefore, despite more than 50 years of effort, optical functionalities 
in high-performance piezoelectrics have not been realized. 
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Fig. 1| Phase-field simulations of the domain structures and evolution of 
[001]-oriented rhombohedral PMN-28PT single crystals via d.c. anda.c. 
poling. a, Domain structures for unpoled, d.c.- and a.c.-poled samples. The 
subscript c denotes pseudo-cubic coordinates. The black and white vectors on 
the right show the polarization directions, and the colour denotes the positive 
(white) and negative (black) polarization components along the [100] 
direction. The other colours represent different ferroelectric domains (the 
corresponding polar directions are illustrated on the right). Examples of 71° 
and 109° domain walls are indicated by the red arrows. A beam of light is 


Inthis work, we show that a.c. electric fields can be used to effectively 
eliminate light-scattering 71° domain walls for [001]-oriented rhom- 
bohedral PMN-PT crystals to achieve both near-perfect transparency 
and ultrahigh piezoelectricity. We first perform phase-field simulations 
to study the domain evolution of a [001]-oriented 0.72Pb(Mg,,;Nb,,3) 
O;-0.28PbTiO; (PMN-28PT) rhombohedral crystal using conventional 
d.c. and a.c. electric fields. We generate the initial pristine unpoled 
state starting from a random distribution of small polarizations that 
represent a high-temperature paraelectric state. The obtained multi- 
domain state contains all eight possible (111) rhombohedral domain 
variants with an average size of -20 nm. Three types of domain wall 
are present in the unpoled rhombohedral crystal: 71°, 109° and 180° 
domain walls. 

Under ad.c. electric field along the [001] direction, the four domain 
variants with polarizations along [111], [111], [111] or [111] are switched 
tothe [111], [111], [111Jor [111] directions. Thus, only 71° and 109° domain 
walls survive, whereas the 180° domain walls are eliminated, as shown 
Fig. la. The horizontal layers are separated by a set of 109° domain walls 
parallel to the (001) plane, whereas within each lamina there are 71° 
domain walls approximately parallel to {011} planes. It should be noted 
that 71° domain walls can scatter light because the refractive indices 
n,and n, (where o and e represent ordinary and extraordinary light, 
respectively) change as light travels across a 71° domain wall, as shown 
in Extended Data Fig. 1. By contrast, 109° domain walls do not induce 
light scattering as the refractive indices are the same for the domains 
on both sides of the wall. 
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schematically illustrated in the d.c.- and a.c.-poled cases, indicating that the 
light may not be scattered as it travels through the a.c.-poled sample. 

b, Snapshots of the domain pattern evolution during the reversal of polarization 
under a.c. poling. The initial state (that is, electric field F=0.0kVcm”) is the 
d.c.-poled sample. The blue and red arrows represent the applied electric fields 
along the [001] and [001] directions, respectively. The dimensions of all of the 
domain structure plots are 512 nm x 512 nm. The domain evolution under ana.c. 
electric field is shownin Supplementary Videos 1, 2. For comparison, the 
domain evolution under ad.c. electric field is shownin Supplementary Video 3. 


Our phase-field simulations demonstrate that the application ofan 
a.c. electric field effectively reduces the number of 71° domain walls, 
with only two 71° domain walls left in each lamina after a.c. poling, 
leading toa much larger domain size within each lamina. To understand 
the reason for the elimination of 71° domain walls by a.c. poling, we 
analyse the domain evolution during the polarization reversal process, 
as shown in Fig. lb and Supplementary Videos 1, 2. One cansee that the 
reversal of the electric field causes ‘swinging’ of 71° domain walls: that 
is, 71° domain walls alternate between the (011) and (011) planes. Dur- 
ing this process, the contiguous 71° domains tend to merge with each 
other, resulting ina considerable increase in 71° domain size after a.c. 
poling. In addition (Extended Data Fig. 2), the total free energy of the 
system is reduced during this poling process as the energies arising 
from the discontinuities of polarization/strain associated with domain 
walls decrease when the domain wall density decreases. In other words, 
alternating the polarity of the electric field lowers the free energy ofa 
ferroelectric crystal, leading to a domain structure with reduced 
domain wall density. As discussed above, owing to the substantially 
decreased 71° domain wall density, the light transmission of the a.c. - 
poled sample is expected to be superior to that of a corresponding 
d.c.-poled sample. 

Following the phase-field simulations, we characterized the domain 
structures of a.c.-poled and d.c.-poled PMN-28PT crystals. Using bire- 
fringence imaging microscopy (BIM)”, we characterized the orienta- 
tion (@) of the principal axis of the optical indicatrix projected on the 
(001) planes, as shown in Fig. 2a. For arhombohedral single domain, 
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Fig. 2| Analysis of the domain structures for the [001]-oriented PMN-28PT 
crystals. a, BIM image; the colours indicate the orientation angles (@) of the 
projection of the principal optical axis on the (001) plane. Red and blue 
represent the projection of the principal axis of the optical indicatrix along the 
diagonal faces onthe (001) plane, that is, @ of 45° and 135°, respectively. As an 
example, the projections of the principal axis of the optical indicatrix of the 


the projection of the optical axis is along the diagonal face. There- 
fore, the g value of arhombohedral domain would be 45° or 135°, which 
are represented by blue and red colours, respectively (colour bar in 
Fig. 2a). A multidomain crystal may show two or more colours simul- 
taneously in the projection map owing to the overlap of ferroelectric 
domains and domain walls along the light propagation path. 

The orientation map for the unpoled sample shows an irregular 
colour distribution ona very fine scale. This is because the domain 
size of the as-grown PMN-PT is much smaller than the experimental 
resolution (the wavelength of the light is 590 nm); therefore, the exact 
domain pattern may not be clearly revealed. Compared with classical 
ferroelectrics (for example, BaTiO;), the relatively small domain size in 
relaxor ferroelectrics (of the order of several tens of nanometres before 
poling'®”°) is attributed to the presence of random fields/bonds that 
inhibit the growth of ferroelectric domains” ™. 

After d.c. poling, the domains represented by regions with the same 
colour increase in size, and the boundaries between the domains are 
approximately along the [100] and [010] directions, which are associated 
with the projections of 71° domain walls ((101), (101), (011) or (011) 
planes) on the (001) plane. In this image, the colours of most regions 
are neither red nor blue. Of particular importance is the substantially 
enlarged domain size in the a.c.-poled sample, where the in-plane size 
of the rhombohedral domain is on the millimetre scale. It should be 
noted that the domain size obtained from the phase-field simulation 
is much smaller than that from experiments. This is because the spatial 
scale in the phase-field simulation (512 nm) is much smaller than that 
of the materials in the experiments (at the millimetre scale). By coars- 
ening the scale in the phase-field simulation, the domain size of the 
a.c.-poled crystal is found to increase (Extended Data Fig. 3). It is dif- 
ficult to perform a phase-field simulation on the millimetre scale and 
simultaneously resolve the polarization profiles across domain walls 
of ~1nm in thickness. In this work, we used phase-field simulations to 
qualitatively analyse the domain evolution of PMN-28PT crystals dur- 
ing a.c. poling. We also characterized the cross-section domain struc- 
ture of a.c.-poled and d.c.-poled samples to investigate the domain 
size in the out-of-plane direction. As shown in Extended Data Fig. 4, we 
found that the width between two neighbouring 109° domain walls is 
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[111] and [111] domains are illustrated in the schematics on the right of the figure. 
b, Reciprocal space maps of the {222} reflections from the samples under 
different poling conditions, measured with high-resolution single-crystal X-ray 
diffraction. The colour bars indicate the intensity of the diffraction. The 
thickness of the samples is 175 pm. 


similar for both samples (~1 tim), indicating that most 109° domain 
walls survived after a.c. poling, whichis consistent with the phase-field 
simulations. 

X-ray diffraction patterns confirm the main observations from the 
BIM images. Figure 2b shows the {222} reflections for the [001]-oriented 
PMN-28PT crystals. If the rhombohedral domain variants are evenly 
distributed in the sample, there should be two diffraction peaks in 
the 26-w map (26 is the angle between the transmitted and reflected 
X-ray beams; w represents the angle between the incident beam and 
the sample surface): one peak is associated with the (222) plane ata 
lower 26, and the other peak is associated with the remaining three 
{222} planes at a higher 26. Thus, the integrated intensity of the high-20 
reflection is supposed to be three times that of the low-26 reflection. 
This is approximately what is observed in the unpoled sample. The 
diffused distribution of the diffraction along the w axis is associated 
with the lattice distortions due to the existence of domain walls. The 
diffraction peaks converge into distinctive sharper reflections after d.c. 
poling, indicating that the domains become larger and that the volume 
fraction of domain walls decreases. Eventually, in the a.c.-poled sample, 
only the high-26 diffraction peak is observed, and the diffusiveness of 
the diffraction peak along the w axis is much smaller than that of the 
d.c.-poled and unpoled samples (Extended Data Fig. 5). These features 
reveal that the X-ray beam is approximately incident onasingle domain 
of the a.c.-poled sample. The size of the beam here is about 1 mm’, 
which indicates that the in-plane domain size of the a.c.-poled sample 
is equal to or larger than 1 mm’. 

Owing to the unique domain structure, a.c.-poled PMN-28PT crys- 
tals exhibit numerous attractive properties in addition to their ultra- 
high piezoelectricity, including an electro-optical coefficient y3, of 
220 pm V“, near-perfect light transmittance and enhanced birefrin- 
gence (Extended Data Table 1). Figure 3a shows photographs of the a.c.- 
and d.c.-poled samples: the a.c.-poled samples are clearly transparent. 
The light transmittance of the a.c.-poled sample is found to be very close 
to its theoretical limit and is much higher than that of the d.c.-poled 
sample, especially for the visible-light spectrum (Fig. 3b). The light 
with wavelengths below 400 nm is completely absorbed because of 
the optical absorption edge (-3.10 eV), which is similar to most oxygen 


Wavelength (nm) 


Fig. 3 | Comparison of the properties of a.c.-poled and conventional 
d.c.-poled [001]-oriented PMN-28PT crystals. a, Photograph of a.c- and 
d.c.-poled PMN-28PT crystals. The thicknesses of the crystals are 0.5 mm and 
1.8mm, respectively. b, Light transmittance of the a.c.- and d.c.-poled PMN- 
28PT crystals. c, d;,and birefringence of the a.c.- and d.c.-poled PMN-28PT 


octahedral perovskites*”*. The light absorption coefficient of the 
a.c.-poled sample is found to be almost zero at wavelengths longer 
than 400 nm, whereas the absorption coefficient of the d.c.-poled 
sample remains large and decreases monotonically with increasing 
wavelength (Extended Data Fig. 6). 

The birefringence of the a.c.-poled crystals is approximately one 
order of magnitude higher than that of their d.c.-poled counterparts 
(Fig. 3c). This difference is associated with the different domain struc- 
tures. The principle axis of the optical indicatrices of the domains 
on both sides of a 71° domain wall are perpendicular to each other 
on the (001) plane, resulting in cancellation of the birefringence as 
light travels through a 71° domain wall (see Methods section ‘PLM’). 
For a.c.-poled crystals, however, the birefringence is approximately 
equal to that of the intrinsic value of a single-domain rhombohedral 
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Fig. 4 | Phase-field simulations of the effect of domain size onthe 
polarization, free-energy density and properties of the PMN-28PT crystal. 
a, Two-dimensional schematic of a[001]-poled PMN-28PT with various 71° 
domain widths (D,,). ‘Inf’, no 71° domain walls are present in the system. The 
polar directions for different colours are depicted onthe right. b, Schematic of 
polar vectors around a 71° domain wall (marked by the green lineina). The 
colours of the arrows represent the angles between each polar vector and the 
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crystals. Tena.c.-poled and ten d.c.-poled samples are used for the 
characterization of d;3, whereas the birefringence is determined from seven 
points of thea.c.- and d.c.-poled samples. The red graded arrow indicates the 
increase in d,, and birefringence for PMN-PT crystals after a.c. poling. The error 
bars inc present the standard deviation of the corresponding data. 


PMN-28PT crystal because of the substantially decreased number of 
71° domain walls. 

The a.c.-poled crystals also exhibit a30% enhancement in d,, over 
the d.c.-poled crystals. This phenomenon was observed in many 
a.c.-poled relaxor-PT crystals” *°. To understand the mechanism for 
the enhanced piezoelectricity, we conducted phase-field simulations 
to calculate the piezoelectricity of PMN-28PT with different 71° domain 
sizes, as shown in Fig. 4a. The variation of polar vectors around a 71° 
domain wall is depicted in Fig. 4b. For an ideal rhombohedral domain, 
the polar vectors are along the (111) directions, and the angle between 
the polar vector and the [011] direction is 35.3°. However, the presence 
of 71° domain walls causes the polar vectors to rotate towards the [011] 
direction to minimize the polarization gradient and elastic energies 
associated with the polarization/strain discontinuities around the 
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[011] direction. D,, in this schematic is 64 nm.c, The average magnitude of the 
spontaneous polarization components P, ([100]), P, ([010]) and P, ([001]) ofa 
ferroelectric domain asa function of D,,. The red line describes the angle 
between the overall polarization and the [111] direction asa function of D,,. 

d, The average free-energy density of the system with respect to P,. The free- 
energy density at the stable state (that is, AP=0) is selected as the reference. 
e, Dielectric permittivity (€33/€9) as a function of D,,. f, d33as a function of D,,. 
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domain walls. With decreasing domain size, the impact of the domain 
walls on the polarization becomes more prominent, resulting ina larger 
deviation of the polarization from the (111) directions, as shown in 
Fig. 4c. The free-energy profile of the system is also affected by the 
variation in polarization. As shown in Fig. 4d, the average free-energy 
profile of the system as a function of AP, (the change in polarization 
Palong the zdirection; that is, the [001] direction) is flattened with the 
increase in the width of the 71° domains; that is, the curvature of 
the free-energy profile (8°G/@P2\yp.-o) decreases with increasing 71° 
domain width. Thus, the calculated €,, and d;, increase with enlarging 
71° domain widths, as shown in Fig. 4e, f. 

It should be noted that the observed enhancementin piezoelectricity 
with increasing domain size is not universal to all ferroelectric crystals. 
Itis expected that the piezoelectricity of ferroelectrics is also related to 
thesymmetry of the ferroelectric phase and domain configurations”. 
For example, in tetragonal BaTiO; (refs. °°) and high-temperature- 
poled PZN-PT crystals*?, where domain configurations are very different 
from PMN-28PT, the domain-size dependence of piezoelectricity shows 
an opposite trend to that observed here. 

For the purpose of practical applications, we also studied the tem- 
perature stability of the properties of a.c.-poled PMN-28PT crystals 
(Extended Data Fig. 7), which indicates that the temperature-dependent 
electromechanical behaviours of a.c.- and d.c.-poled samples are very 
similar. Of particular interest is the fact that at temperatures below the 
rhombohedral-tetragonal phase transition temperature (~95 °C), the 
domain structure remains essentially the same, and no depolarization 
behaviour is observed, indicating that a.c.-poled crystals can be used 
up to their respective phase transition temperatures. 

Insummary, we report a simple approach that uses a.c. electric fields 
to simultaneously achieve near-perfect transparency, enhanced piezo- 
electricity and birefringence in rhombohedral PMN-PT crystals. Such 
transparent crystals are expected to find a wide range of applications 
in coupled electro-optical-mechanical devices. Both experimental 
and simulated results demonstrate that the enhanced piezoelectricity 
of [001]-oriented rhombohedral PMN-PT crystals achieved through 
a.c.-poling is due to the increased domain size, in contrast to the long- 
standing belief that decreasing domain size always leads to higher 
piezoelectricity. 
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Methods 


Sample preparation 

To achieve high transparency in the rhombohedral PMN-PT crystals, we 
choose crystal compositions that avoid the presence of multiple phases, 
such as a mixture of rhombohedral, monoclinic and orthorhombic 
phases within an MPB region. In particular, we selected the 28% PT com- 
position to circumvent the MPB region (around 33% PT). The PMN-28PT 
single crystals were grown by a modified Bridgman method at Xi’an 
Jiaotong University. The crystals were oriented by an X-ray diffraction 
(XRD) method with the x, y and zaxes along the [100], [010] and [001] 
directions, respectively, and then cut to the required dimensions for 
different experiments. The thickness of the samples for the domain 
observations and birefringence measurements was 0.175 mm. For the 
other experiments, the thickness of the samples was in the range of 
0.5 mm to 4 mm. Vacuum-sputtered gold was applied to both (001) 
faces of the samples to act as electrodes. 

The a.c.- and d.c.-poling experiments were performed with a fer- 
roelectric test system (TF Analyzer 2000E, aixACCT) with a high-volt- 
age amplifier (TREK 610E). The samples were immersed in silicone oil 
during the poling process. For a.c. poling, a bipolar triangle wave was 
applied tothe samples witha frequency of 0.1-100 Hz, acycle number 
of 5-20, anamplitude of 5kVcm‘and a poling temperature of 20-60 °C 
(ref. **), We discovered that essentially all of the 71° domain walls could 
be effectively eliminated by using a.c. electric fields with a broad range 
of frequencies from 0.1to 100 Hz (see Extended Data Fig. 8). The cycle 
number was selected to be larger than 10, as the enhancement of die- 
lectric permittivity was found to saturate after 10 cycles, as shown in 
Extended Data Fig. 9. To minimize the fluctuations in dielectric and 
piezoelectric properties among different samples, the frequency of 
thea.c. electric field was selected to be below 10 Hz, as shown Extended 
Data Fig. 9. To minimize the influence of internal stresses generated 
during polishing and sputtering of PMN-28PT crystals, a thermal 
annealing process was applied to the PMN-28PT crystals before a.c. 
poling**. Specifically, the samples were first annealed at 300 °C for 
5 hand then slowly cooled to room temperature. The samples then 
remained at room temperature for 5-7 days until the dielectric loss 
was reduced from approximately 4—5% after the annealing to around 
2%. For the d.c. poling, a conventional poling process used for PMN-PT 
crystals was adopted, witha d.c. electric field amplitude of 5 kV cm™ 
and a dwelling time of 5 min. 

For the optical and XRD experiments, the gold electrodes were 
removed by a solution of potassium iodide and iodine (the mass ratio 
of KI:L,:H,O was 4:1:40) without affecting the polarization. The (001) 
surfaces were then carefully polished to optical quality using diamond 
polishing paste and decreasing the average grit size downto 0.05 pm. 
For the electro-optic measurements, a thin layer of gold film (-15 nm) 
was deposited onthe polished surfaces to act as electrodes. Silver leads 
were attached using conductive epoxy to apply a voltage. 

Here, we would like to note that there are two technical issues associ- 

ated with the idea of using crystals poled along the [111] direction and 
then cut along the [001] direction to achieve both a high d,, and light 
transparency rather than the a.c. poling employed in this work. 
(1) The limitations of crystal size and difficulty in achieving a single 
domain state by poling [111] oriented rhombohedral crystals. For 
example, if one needs a [001]-oriented crystal plate with a size of 
20 mm x 20 mm x 1mm (the practical size of piezoelectric materials 
for medical transducers is in the range of 20-60 mm), one first needs 
a [111]-oriented crystal with the size of ~30 x 30 x 30 mm>. It is almost 
impossible to pole a crystal witha thickness of 30 mmalong its sponta- 
neous polarization direction as the internal stresses that develop during 
poling induce severe cracking”. In addition, one cannot guarantee a 
homogeneous composition for the [001]-oriented crystal plates made 
using this process because of the composition segregation along the 
growth direction of Bridgman-grown PMN-PT crystal boules. 


(2) The instability of a single-domain state and the issue of depolariza- 
tion. Preparing the [111]-poled samples witha rotated d;, involves high- 
temperature processes, suchas attaching a crystal toa sample holder 
for cutting, heat generation during cutting and sputtering transparent 
electrodes (the temperature required for sputtering ITO electrodes is 
approximately 300-600 °C). One may argue that we can adopt care- 
ful, low-temperature processes and utilize low-temperature trans- 
parent electrodes (such as silver nanowires) to prepare the samples. 
However, based on our experience, this is a difficult task, as we need 
to use special binders and sample holders and drastically slow down 
the cutting speed of the cutting machine; the adhesion of electrodes 
would also be substantially affected by using silver nanowires in place 
of ITO. Even if all of these precautions and approaches are followed, 
the samples are still likely to be depolarized to some extent because of 
the instability of a single-domain state. In addition, the [001]-oriented 
samples made by this process cannot be re-poled if they are de-poled 
accidentally. This is due to the fact that we cannot recover the original 
[111]-oriented samples. 


Dielectric and piezoelectric measurements 

The d,, values were determined by a combination of a quasi-static d;, 
meter (ZJ-6A) and an electric-field-induced strain measurement. The 
electric-field-induced strain was measured by a ferroelectric test sys- 
tem (TF Analyzer 2000E, aixACCT) with a laser interferometer (SIOS 
SP-S 120E). The dielectric permittivity was measured using an LCR 
meter (E4980A, KEYSIGHT technologies). To study the temperature 
stability of the properties, the d,, and k,, values of PMN-PT crystals 
are determined by the resonance method according to IEEE standard. 


Optical transmittance measurements 
Transmission spectra were measured with an ultraviolet-visible-infra- 
red spectrophotometer (JASCO V-570) at wavelengths ranging from 
300 to 2,500 nm. The incident light was set to transmit through the 
crystal along the poling direction, which was perpendicular to the 
(001) surface. 

According to the Fresnel equations, the reflection loss at two faces 
of the crystal plate was calculated from 450 to 850 nm through: 


_(n-1P 
n+l 


(1) 


where nis the wavelength-dependent index of refraction, calculated 
from the Sellmeier equation for a PMN-28PT single crystal given in 
refs. °°”, 

The effective loss coefficient @.,, a combination of the scattering 
coefficient x and the absorption coefficient @ (a= + a), was calculated 
using the transmission data from samples of different thicknesses: 


__ In(7,/N) 
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where 7, and 7, are the transmittances of the two samples with thick- 
nesses ¢, and t,, respectively. 


Optical domain characterizations 

PLM. The domain patterns and their extinction behaviours were ob- 
served using a PLM with a 0°/90° crossed polarizer/analyser pair 
(OLYMPUS BXS1). The optical retardation was measured using a thick 
Berek compensator (OLYMPUS U-CTB, ~0-10A) and an interference 
filter (IF546; wavelength, A = 546.1 nm). The birefringence was calcu- 
lated as the ratio of the retardation to the sample thickness. In the fol- 
lowing, we would like to explain the cancellation effect of birefringence 
as light travels across a 71° domain wall. For a single-domain rhombo- 
hedral ferroelectric, the parameters An, and An, are the birefractive 
indices of two different domains on both sides of a 71° domain wall. 
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Because the principal axes of the optical indicatrices of the domains 
on both sides of a 71° domain wall (for example, the domains with polar 
vectors along [111] and [111] directions) are perpendicular to each 
other onthe (001) plane, the relation between An, and An, is An,=—An,. 
Thus as the light travels across a 71° domain wall, the measured bire- 
fringence can be expressed by the following equation: 


d,- d, 
d 


AL An,d,+ An,d, 
d= a+d, 


An* = An; 


(3) 


where d, and d, are the lengths of optical path within the [111] and [111] 
domains, respectively, AL is the optical retardation caused by birefrin- 
gence, anddisasum of d,and d.. It is obvious that the measured bire- 
fringence will be quite small if the values of d, and d, are similar. 


BIM. The domain orientation measurements were performed using BIM 
equipment (Metripol, Oxford Cryosystems). Monochromatic light witha 
wavelength of 590 nm was used as the light source. A quarter-wave plate 
and a polaroid (P,) were placed at a 45° position to produce circularly 
polarized light. The circularly polarized light was converted to elliptical 
polarization after passing through an optically anisotropic specimen. 
The light then transmits through a linear analyser (P,) rotating about 
the microscope axis at a frequency of w. Finally, the intensity measured 
by the charge-coupled device (CCD) cameraasa function of wis given by 


[= loll + sin[2(wt- @)]sind} (4) 


where Cis the time, g is the angle between the horizontal direction and 
the principal axis of the optical indicatrix, dis the phase shift introduced 
to the light rays passing through the anisotropic sample witha certain 
thickness, and /, is the intensity of the unpolarized light. After rotating 
the analyser ten times, itis possible to obtain the intensity of each pixel 
onthe recorded image, refine the /,, |sin6| and @ values, and construct 
the false-colour images. 


High-resolution XRD experiments 

High-resolution, single-crystal XRD experiments were carried out to 
analyse the {222} Bragg reflections for (001)-oriented PMN-PT crys- 
tals. A high-resolution diffractometer (PANalytical X’Pert Pro MRD), 
equipped with Cu Ka, radiation, a hybrid mirror monochromator, an 
open Eulerian cradle and a solid-state PIXcel detector, was used for 
a precise two-dimensional 26-w scan of the {222} Bragg peaks. The 
reciprocal space maps were collected with step sizes of 0.004° in w 
and 0.004° in 20. The intensity of the patterns was accumulated along 
the w or 20 directions and then fitted to the pseudo-Voigt function: 


x-a ‘ 
2 
z: } (5) 


where a, is the intensity of the peak, a, is the position of the peak, a, is 
the mixing parameter of the Gaussian and Lorentzian profiles, and a, 
is proportional to the full-width at half-maximum (FWHM), whichcan 
be calculated using the equation, 
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FWHM = 2,/In2a; (6) 


Electro-optic measurements 

The electro-optic coefficients of the samples were measured using a 
modified Mach-Zehnder interferometer. The light source was a 632. 
8-nm He-Ne laser. For the longitudinal mode, the light beam and the 
applied electric field were both parallel to the poling direction. In this 


mode, the longitudinal effective linear electro-optic coefficient 
Ve was measured, where the subscript c indicates that the coefficient 
is acomposite of several electro-optic coefficients in the standard 
coordinate system (for example, 7,, and r,;). 

For the transverse mode, the light beam travels along the [110] direc- 
tion, and the electric field was applied along the [001] poling direction 
of the sample. The linear electro-optic coefficients y;, and y;, were 
measured when the polarization directions of the laser beam were 
perpendicular and parallel to the poling direction, respectively. The 
transverse effective linear electro-optic coefficient yr) was calculated 
by the equation: 


Ve =V 37 Meviy/ne (7) 


The refractive indices were calculated from the data of single-domain 
crystals*°*”. The asterisk indicates that the measured electro-optic 
coefficient is a combination of the inverse piezoelectric effect and the 
intrinsic electro-optic effect. 


Phase-field simulations 
The domain evolution and piezoelectric responses were obtained by 
performing phase-field simulations. A domain structure is described by 
the spatial distribution of the ferroelectric polarization P. The temporal 
evolution of the polarization and thus the domain structure is described 
by the time-dependent Ginzburgh-Landau equation: 

oP F(P) 

ar OP 8) 
where fis the time, Fis the total free energy and L is the kinetic coef- 
ficient. The total free energy contains contributions from the bulk, 
elastic, electric and gradient energies. Details of how the phase-field 
method can be used to simulate the switching behaviours of ferro- 
electric single crystals can be found in refs. °°’. The Landau coeffi- 
cients were adapted from ref. *° using the experimental ferroelectric, 
piezoelectric and dielectric properties of PMN-28PT crystals at room 
temperature. On the basis of this thermodynamic potential, the calcu- 
lated equilibrium phase is rhombohedral at room temperature witha 
spontaneous polarization of -0.38 Cm”, a relative dielectric constant 
of ~5,500 along [001] and a longitudinal piezoelectric coefficient of 
~1,850 pC N ‘along [001], which are in reasonable agreement with our 
experiments. The electrostrictive coefficients measured by Li et al." 
for PMN-28PT were adopted. The gradient energy coefficients were 
assumed to be isotropic, and the domain wall width was assumed to 
be ~2nm. It should be noted here that the Landau potential used in this 
work represents the average free energy of a single-domain PMN-28PT 
crystal, which incorporates the impacts of the nanoscale heterogeneous 
polar regions (several nanometres) in the free energy and electrome- 
chanical properties”. 

To simulate the a.c. and d.c. poling processes, we first obtained 
an unpoled domain structure from a random noise distribu- 
tion of the polarization within a quasi-two-dimensional grid with 
512Ax x 512Ax x 1Ax grid points (Ax=1nm). A low-frequency triangle 
wave was then applied to mimic the a.c. poling, whereas a single-step 
square wave was used to represent the d.c. poling. The magnitude of 
the poling electric field was 10 kV cm. We performed phase-field simu- 
lations at different mechanical boundary conditions: the stress-free 
condition, which assumes the averaged stress of the simulated system 
is zero; andthe clamped condition, which assumes the averaged strain 
is zero. Figure 1 shows the simulated results for the clamped condition, 
and the simulation results under stress-free conditions are presented 
in Extended Data Fig. 3b. The practical condition in the experiments 
is probably between these two extreme mechanical conditions. The 
conclusion that the sizes of the 71° domains in a.c.-poled samples are 
always much larger than the d.c.-poled ones holds for both mechanical 
boundary conditions, while the 109° domain layer thickness is similar 


in botha.c.- and d.c.-poled samples. It is interesting to note that the a.c. 
poling can lead toacompletely layered 109° domain structure without 
71° domain walls under the stress-free condition. 

The ¢€;; and d;; values were obtained by evaluating the variations 
of polarization and longitudinal strain under a small electric field of 
0.5kV cm ‘along the [001] direction. We also calculate the free energy 
by applying small test electric fields (E, is from 0 to 1kV cm“, which is 
sufficiently small to avoid domain wall motion or phase transitions) 
and plot the average free-energy density as a function of the change 
of P, along the poling direction. 

The computer simulations were performed using the commercial 
software package p-PRO (http://mupro.co/contact/) on the ICS-ACI 
Computing Systems at Pennsylvania State University and at the Extreme 
Science and Engineering Discovery Environment cluster, which used 
the Bridges system at the Pittsburgh Supercomputing Center**“*. 


Data availability 


The data that support the findings of this study are available on request 
from the corresponding authors. 


34. Li, F. et al. Transparent ferroelectric crystals with ultrahigh piezoelectricity. Provisional US 
Patent 16/541,309 (2019). 

35. Li, F., Wang, L., Jin, L., Xu, Z. & Zhang, S. J. Achieving single domain relaxor-PT crystals by 
high temperature poling. CrystEngComm 16, 2892-2897 (2014). 

36. Wan, X. M., Chan, H. L. W., Choy, C.L., Zhao, X. & Luo, H. Optical properties of (1-x) 
Pb(MgjgNi2/3)O3-XPbTIO; single crystals studied by spectroscopic ellipsometry. J. Appl. 
Phys. 96, 1387-1391 (2004). 

37. Wan, X. M. et al. Refractive indices and linear electro-optic properties of (1-x) 
Pb(Mgj3Ni2/3)O3-XPbTIO; single crystals. Appl. Phys. Lett. 85, 5233-5235 (2004). 

38. Wang, J., Shi, S.Q., Chen, L.-Q., Li, Y. & Zhang, T. Y. Phase-field simulations of ferroelectric/ 
ferroelastic polarization switching. Acta Mater. 52, 749-764 (2004). 

39. Wang, J., Wang, B. & Chen, L.-Q., Understanding, predicting, and designing ferroelectric 
domain structures and switching guided by the phase-field method. Annu. Rev. Mater. 
Res. 49, 127-152 (2019). 

40. Khakpash, N., Khassaf, H., Rossetti, G. A. Jr & Alpay, S. P. Misfit strain phase diagrams of 
epitaxial PMN-PT films. Appl. Phys. Lett. 106, 082905 (2015). 


Al. Li,F., Jin, L., Xu, Z., Wang, D. & Zhang, S. J. Electrostrictive effect in Pb(Mg13Nb»/3)O-x 
PbTiO, crystals. Appl. Phys. Lett. 102, 152910 (2013). 

42. Li, F. et al. The origin of ultrahigh piezoelectricity in relaxor-ferroelectric solid solution 
crystals. Nat. Commun. 7, 13807 (2016). 

43. Towns, J. et al. XSEDE: accelerating scientific discovery. Comput. Sci. Eng. 16, 62-74 
(2014). 

44. Nystrom,N.A., Levine, M. J., Roskies, R. Z. & Scott, J. R. Bridges: a uniquely flexible HPC 
resource for new communities and data analytics. In Proc. 2015 Annual Conference on 
Extreme Science and Engineering Discovery Environment (ACM, 2015); https://doi. 
org/10.1145/2792745.2792775. 


Acknowledgements F.L. and Z.X. acknowledge the support of the National Natural Science 
Foundation of China (grant numbers 51922083, 51831010 and 51761145024), the development 
programme of Shaanxi province (grant number 2019ZDLGY04-09) and the 111 Project 
(B14040). BW. and L.Q.C. acknowledge the support of the US National Science Foundation 
under the grant number DMR-1744213 and Materials Research Science and Engineering Center 
(MRSEC) grant number DMR-1420620. N.Z. thanks the NSFC for support under grant number 
1604123. The computer simulations were performed on the ICS-ACI Computing Systems at 
Pennsylvania State University through the Penn State Institute for Cyber Science and at the 
Extreme Science and Engineering Discovery Environment cluster supported by National 
Science Foundation grant number ACI-1548562; the Bridges system was used, which is 
supported by NSF award number ACI-1445606 at the Pittsburgh Supercomputing Centre 
under the allocation DMR170006. S.Z. thanks the ONRG (grant number N62909-18-12168) and 
ARC (grant number FT140100698) for support. T.R.S. was supported by the US ONR. 


Author contributions The work was conceived and designed by S.Z., L.Q.C. and F.L. C.Q. 
performed the piezoelectric and optical experiments. F.L. and Z.X. supervised the 
piezoelectric and dielectric measurements. N.Z. and F.L. supervised the optical experiments. 
B.W. performed the phase-field simulations and discussed with F.L. L.Q.C. supervised the 
simulation work. J.L. assisted with the piezoelectric measurements. N.Z. and DW. performed 
XRD experiments. Y.W. and H/T. assisted with the optical measurements. F.L. drafted the 
manuscript, S.Z., N.Z., L.Q.C. and T.R.S. revised the manuscript and all authors discussed the 
results. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-019- 
1891-y. 

Correspondence and requests for materials should be addressed to Z.X., L.-Q.C. or F.L. 
Peer review information Nature thanks Wook Jo and the other, anonymous, reviewer(s) for 
their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Article 


The principal axes of 


a : ccemeign 
<—Ty the optical indicatrix ! 
= Ss 
109° domain wall 
1 
‘ ¥ 
3 001 
3 (001) 
— 1 
N ae =a 
! 
Y // [010], Z 
b i] 
! 
> SS 
Fal 
pa 5 
° i] 
m2 (001) 
N axe aD 


Y // [010], 


domain wall. Thus, n, and n, do not change as the light crosses a109° domain 
wall, resulting in the absence of light scattering and/or reflection. In contrast, 
the principal axis of the optical indicatrix projected on the (001) plane rotates 
by 90° as the light travels across a 71° domain wall, resulting in the alternating 
of refractive indices n, and n,. This is the reason that 71° domain walls scatter 


Extended Data Fig. 1| Schematic diagrams of the projections of optical 
indicatrices on both sides of adomain wall. a, 109° domain wall; b, 71° domain 
wall. The prerequisite for the light scattering and reflection at an interfaceis 
the difference in refractive indices between the optical media on each side of 
the interface. As shown here, the principal axes of the optical indicatrix 
projected on the (001) plane are the same for domains on each side of a109° and/or reflect the light. 
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Extended Data Fig. 2| The variation in various energies of PMN-28PT during the first four cycles ofa.c. poling. The data are obtained by phase-field 


simulations. Here we plot the normalized energies, which are dimensionless. 
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Extended Data Fig. 3 | Effects of system size and mechanical boundary conditions on thea.c.- and d.c.-poled domain structures from phase-field simulations. 
a, Effects of system size (at the clamped boundary condition). b, Effects of the mechanical boundary condition (the scale of the simulation is 512 x 512 nm). 
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Extended Data Fig. 4| Characterization of the domain size along the poling 
direction for [001]-oriented rhombohedral PMN-PT crystals. a, PLMimages 
on the (100) face for [001]-poled PMN-28PT crystals under a.c. poling. b, PLM 
images on the (100) face for [001]-poled PMN-28PT crystals under d.c. poling. 
c, Distribution of the thickness of laminar domains for the a.c.-poled sample 
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measured froma. d, Distribution of the thickness of laminar domains for the 
d.c.-poled sample measured from b. The thickness of the samples is around 
100 um. As shown in this figure, no clear difference is observed for the 
thickness of laminar domains betweena.c.- and d.c.-poled samples. 
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Extended Data Fig. 5 | Pseudo-Voight fittings of the {222} diffraction peaks at high-20. a,b, The unpoled sample. c,d, The d.c.-poled sample. e,f, The a.c.-poled 
sample.a,c,e, The intensity of the patterns accumulated along 0. b,d,f, The intensity of the patterns accumulated along 20. 
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Extended Data Fig. 6 | Effective light absorption coefficients of a.c.- and d.c.- 
poled PMN-28PT crystals. 


Article 


a dy, 
Sp 16000 
—— AC-poled 
—e— DC-poled 14000 
ead | 12000 
Zz 
Oo 
Q 2400+ 2 10000 
= 8 
ise] 
© 20001 se 
6000 
1600 + } 
4000 
20-10 0 10 20 30 40 50 60 
b Temperature (°C) 


+p [010] 25°C 


A [100] 


AC-poled 


Extended Data Fig. 7| The temperature stability ofa.c.- and d.c.-poled PMN- 
28PT crystals. a, d,3, €3;'/€) and k,3 as a function of temperature. The dwelling 
time for each testing temperature is 10 min. b, Domain structure of a.c.-poled 
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PMN-28PT crystals as a function of temperature. The temperature is increased 
from 25to100 °Cata rate of 1 °C min”. The dwelling time for each testing 
temperatureis 30 min. 
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Extended Data Fig. 8| PLMimages of [001]-oriented PMN-28PTcrystalswith g-i, Images obtained fromthe crystals poled by ad.c. electric field. 

different poling conditions. a—d, Images obtained from the crystals poled by g,i, Magnifications of the two regions circled inh. The thickness of the samples 
ana.c. electric field at frequencies of 0.1Hz (a), 1Hz (b), 10 Hz (c) and 100 Hz (d), is 0.5mm. There are numerous cross-like domain walls in d.c.-poled samples, 
respectively. e, f, Enlargements of the regions in b andc, respectively. whereas these types of domain wall are removed by ana.c. electric field. 
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Extended Data Fig. 9 | Dielectric permittivity and piezoelectric coefficient 
of thea.c.-poled PMN-28PT crystal as a function of the cycle number and 
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Extended Data Table 1| Electromechanical and electro-optical properties of [001]-oriented PMN-28PT crystals via a.c.- and 
d.c.-poling 


Properties of PMN-28PT DC-poled = AC-poled 
Free dielectric permittivity (¢33'/e0) 58004120 7800+230 
Clamped dielectric permittivity 780+50 840+60 
Dielectric loss at 1 kHz (%) 0.4840.02  0.30+40.04 
Piezoelectric coefficient (pC N") 1670430 2190+40 
Electromechanical coupling factor k33 0.9340.01  0.94+0.01 
Phase transition temperature Ter (°C) 93+1 93+1 
Curie temperature Tc (°C) 13242 13242 
Birefringence (107°) 1.8940.55 23.740.7 
Effective electro-optical coefficient y3;, (pm V') @633nm light beam//[110] - 227423 
Effective electro-optical coefficient 7:3, (pm V"') @633nm light beam//[110] - 115+11 
Effective electro-optical coefficient y."" (pm V"') @633nm light beam//[001] - 153+20 


To measure yt*, both the applied electric field and the light beam are along the [001] direction. To measure coefficients v3, and 7;;, the applied electric field is along the [001] direction, and the 
light beam is along the [110] direction. Note: it is difficult to obtain accurate electro-optical coefficients for d.c.-poled samples owing to the strong light scattering by the high density of 
ferroelectric domain walls. The errors indicate the s.d. of the corresponding data (more than five samples/data points are used for each measurement). 
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native material? 


Template-directed assembly has been shown to yield a broad diversity of highly 
ordered mesostructures’”, which in a few cases exhibit symmetries not present inthe 
. However, this technique has not yet been applied to eutectic 


materials, which underpin many modern technologies ranging from high- 
performance turbine blades to solder alloys. Here we use directional solidification ofa 
simple AgCI-KCl lamellar eutectic material within a pillar template to show that 
interactions of the material with the template lead to the emergence of a set of 
microstructures that are distinct from the eutectic’s native lamellar structure and the 
template’s hexagonal lattice structure. By modifying the solidification rate of this 
material-template system, trefoil, quatrefoil, cinquefoil and hexafoil mesostructures 
with submicrometre-size features are realized. Phase-field simulations suggest that 
these mesostructures appear owing to constraints imposed on diffusion by the 
hexagonally arrayed pillar template. We note that the trefoil and hexafoil patterns 
resemble Archimedean honeycomb and square-hexagonal-dodecagonal lattices®, 
respectively. We also find that by using monolayer colloidal crystals as templates, a 
variety of eutectic mesostructures including trefoil and hexafoil are observed, the 
former resembling the Archimedean kagome lattice. Potential emerging applications 
for the structures provided by templated eutectics include non-reciprocal 
metasurfaces’, magnetic spin-ice systems®”, and micro- and nano-lattices with 
enhanced mechanical properties”. 


Eutectic materials have been considered for use as optical metamateri- 
als”? and plasmonic structures“, and for mechanical® and energy-har- 
vesting” applications. During solidification, the constituent species in 
the molten eutectic diffuse perpendicular to the solidification direction 
near the solidification front. The kinetics of this mass transport inthe 
liquid just ahead of the solidification front drives arrangement of the 
solid phases in patterns with generally consistent length scales that are 
roughly inversely proportional to the square root of the solidification 
velocity”. Decades of research on solidification of binary and ternary 
eutectic systems indicate that only alimited set of regular mesostruc- 
tures emerge, even over abroad range of processing conditions”. By 
engineering the heat removal, for example, the microstructure could 
be modified from straight to curved lamellae”, or from lamellar to 
rod-like’; however, the resulting microstructures are still quite similar 
to those found in the native eutectic. As we describe here, this limita- 
tion can be overcome by introducing a template phase into which the 
eutectic melt is solidified. Although the introduction of atemplate has 
been demonstrated to drive the self-assembly of organic molecules” 
into highly ordered patterns!”>”° and new symmetries? >”””’, such an 
approach has not been applied to any eutectic system. As we previously 
reviewed”®, only now is the promise of the effects of atemplate ona 
solidifying eutectic for forming new and complex eutectic mesostruc- 
tures beginning to be exploited)”. 


Here, starting with the well-studied AgCI-KCl eutectic” * (see Meth- 
ods for details), we examine howa template affects what would other- 
wise be a regular lamellar microstructure (see the scanning electron 
microscopy (SEM) image in Fig. 1a). The templates used in this work 
consist of ahexagonal lattice of Ni pillars that are 4-6 pm tall and SOO- 
620 nmin diameter (see Fig. 1b and Methods for details), with edge gaps, 
g (defined in the inset of Fig. 1b), of 160-290 nm. Pillar diameters andg 
were selected to be comparable to the accessible range of the average 
lamellar spacing, A (defined in the inset of Fig. 1a), inthe AgCI-KCI lamel- 
lar eutectic. Outside the template, a lamellar structure emerges with 
aA that depends on the solidification rate (see Extended Data Fig. 1a), 
as expected. However, inside the pillar template, there is aremarkable 
transition, and abroad array of solidification-rate-dependent complex 
mesostructures appear. When / is commensurate with the edge gaps 
inthetemplate, the pillars modify the phase separation of the eutectic 
such that spoke-like patterns in AgCl and KCl are realized instead of the 
regular lamellar structure. We designate the resultant structures as 
trefoil (see Fig. 1c), quatrefoil (see Fig. le), cinquefoil (see Fig. 1f) and 
hexafoil (see Fig. 1g), based on the number of KCI spokes per unit cell 
of the template (see schematic in Extended Data Fig. 2). Remarkably, 
the trefoil pattern resembles the Archimedean honeycomb lattice 
(see schematic in Fig. 1d) of roughly hexagonally shaped AgCl and KCI 
domains. In contrast, the hexafoil pattern resembles the Archimedean 
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Fig. 1| Selected microstructures formed by template-directed eutectic 
solidification. a, SEM image of AgCl (bright)-KCI (dark) eutectic solidified ata 
cooling rate of 22 °C min”. Average / as defined in the inset is 420 nm. The 
solidification direction is out of the image (z axis), as indicated by the red 
dotted circle. b, Plan-view SEM image of a pillar template sample showing the 
hexagonal arrangement of pillars; here g=220 nm, as defined in the inset. 
c-h, SEMimages of microstructures, with lattice schematics shown for 
selected cases.c, Trefoil, e, quatrefoil, f, cinquefoil and g, hexafoil patterns 


square (KCI)—-hexagonal (pillars)-dodecagonal (AgCl) lattice (the SHD 
lattice, see schematic in Fig. 1h). 

Pillar templates were infilled by molten AgCI-KCl eutectic that 
was directionally solidified at various rates (see Extended Data 
Fig. 1b and Methods for details), leading to different values of g/A (A 
determined outside the template region; see Extended Data Fig. Ic). 
When g/A < 0.25, a disordered pattern (termed no-foil) persists (see 
Extended Data Fig. 3a—d). Uponincreasing the solidification rate such 
that 0.4 <g/A< 0.75, trefoil patterns were formed (see Extended Data 
Fig. 3f-i). In the transition region when g/A = 0.3, a mixed, predomi- 
nantly trefoil, structure is present (see Extended Data Fig. 3e). For 
g/A= 0.95, quatrefoil patterns dominate (see Extended Data Fig. 4a, 
b). When g/A = 1, the cinquefoil pattern appears (see Extended Data 
Fig. 4c), and for g/A=1.05, the hexafoil pattern is obtained (see Extended 
Data Fig. 4d-f). In Extended Data Fig. 5 we include quantitative image 
analysis (see Methods for details) of the local and long-range order in 
these patterns. The cross-sectional view of these spoke-like patterns 
shows a tilted alignment of the eutectic phase boundaries (Extended 
Data Fig. 6a). Thermal simulations based on our experimental setup 
suggest that the solidification front within the bulk of the template 
would be slanted (Extended Data Fig. 6b) owing to the nonplanar geom- 
etry of the sample (see Methods for details). In experiments, there may 
be other factors that affect the orientation of the solidification front, 
including inhomogeneities in thermal properties. These factors are 
ignored inthe phase-field simulations, which assume a linear thermal 
gradient parallel to the pillar axis, leading to a generally planar solidifi- 
cation front perpendicular to the pillar axis; additionally, differences 
inthe thermal conductivities of the eutectic phases and pillars are not 
considered (see Methods for details). Despite these simplifications, 
the phase-field simulations (Fig. 2) produce structures that match the 
experimentally observed structure of the eutectic (Fig. 1). 

In phase-field simulations (see Methods for details), when g/A=0.163, 
Ais substantially larger than the edge gap (g=220 nm) of the template 
and the template does not impose order onthe eutectic pattern, result- 
ing in ano-foil structure (see Fig. 2c, j). For increased solidification 
rates, the simulations predict the trefoil pattern for 0.325 <g/A< 0.651 
(see Fig. 2d-f, k-m), the cinquefoil pattern for g/A = 0.814 (see Fig. 2g, 
n), and the hexafoil pattern for g/A = 0.976 (see Fig. 2h, i, o, p). We note 
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with 3, 4,5 and 6 KCl spokes per unit cell of the template (see schematicin 
Extended Data Fig. 2), respectively, were obtained by varying the solidification 
conditions. d, Schematic of the Archimedean honeycomb lattice (for 

c);h, schematic of the Archimedean SHD lattice (for g). Parts of the SEM images 
(inc and e-g) are false-coloured, with AgCl as yellow, KCl as blue and Ni pillars 
as black. Uncoloured SEM images are displayed in Extended Data 

Figs. 3g, 4b,c,d. Allscale bars, 1pm. 


that phase-field simulations suggest that it is necessary to have the 
solidification direction parallel to the template pillar axis (that is, along 
the zaxis; see Fig. 2a, b), rather than perpendicular to it, for these highly 
ordered patterns to emerge. These results reveal that at the solidifica- 
tion front within the template, the pillars disrupt the natural edgewise 
diffusion of the lamellar eutectic by compelling the diffusion fields 
to obey constraints set by the template geometry. To maintain the 
requirement of consistent diffusion path lengths” within this modi- 
fied diffusion field, the eutectic solidifies in spoke-like patterns while 
preserving the overall hexagonal symmetry imposed by the template. 
The patterns observed for various g/A in experiments and phase-field 
simulations are mapped in Fig. 3. This map suggests that certain ranges 
of lamellar spacings and template edge gaps will result in a single type 
of spoke-like pattern, a useful finding for setting the parameters to 
achieve a specific mesostructure. While some experimentally observed 
patterns were not observed inthe simulations, this is probably due to 
enforcement of periodic boundary conditions and the use of adomain 
size of one template unit cell, which constrains the system and pre- 
vents the emergence of asymmetric patterns or patterns with larger 
periodicities. Simulations were also performed for g= 440 nm (see 
Extended Data Fig. 7), showing the possibility of obtaining additional 
patterns by changing the pillar diameter as well as g/A. The effects of the 
shape of the pillars (circular and oval) were also investigated via phase- 
field modelling (see Extended Data Fig. 8). The eutectic maintains the 
overall pattern expected for a hexagonal arrangement of pillars, but 
the morphologies of individual phases are strongly dependent onthe 
exact shape of those pillars, showing the versatility of this process. 
Building on these findings, template-directed eutectic solidification 
was also explored within a monolayer silica colloidal crystal in which 
spherical silica colloidal particles are assembled into atwo-dimensional 
hexagonally close-packed structure (see Fig. 4a, b, and Extended Data 
Fig. 9a). The sphere diameter, d, was 560 nm (comparable to the pillars). 
Ata solidification rate where A = d, the eutectic pattern organized into 
atrefoil pattern (see Fig. 4c), while a hexafoil pattern (see Fig. 4d) was 
observed when <d. As expected, when the solidification rate was slow 
such that A > d, the colloidal crystal template imposes no order on the 
eutectic pattern, resulting ina disordered structure (see Extended Data 
Fig. 9b). The trefoil pattern in this template resembles the Archimedean 


e@4=1,352nm dA=676nm edA=450.67nm f 4=338nm gig 270.4 nm fh i - 395.33 nm i 2 = 193.14 nm 


Fig. 2| Patterns observed in phase-field simulations. a, b, Three-dimensional 
views of the phase-field simulation domain showing the evolution of the initial 
lamellar seed (a; cross-section shown in h) into the hexafoil structure (b; cross- 
section shownino). The solidification direction ina and bis along thezaxis. 
c-i, Images of the initial conditions of simulations performed using the given 
lamellar spacing. j—p, Images showing the corresponding steady-state 


kagome lattice (see schematic overlaid on Fig. 4c), approximating AgCl 
and KClas triangles and silica colloids as hexagons. These results sug- 
gest that such patterns can be obtained not only from pillar templates, 
but also from other two-dimensionally arranged obstacles. Owing to 
the planar geometry of the sample, the solidification front appears to 
remain parallel to the substrate throughout the template (see Extended 
Data Fig. 9c, d). 

The realization of complex mesostructures from solidification 
of asimple system consisting of a lamellar eutectic in a hexagonally 
arranged template indicates that the complex thermal and diffusion 
landscapes imposed by a template on a solidifying eutectic can lead 
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Fig. 3 | Mesostructure phase map. Map of experimentally observed (red) and 
phase-field simulated (black) mesostructures (shown onthe vertical axis) asa 
function of g/A. The plotted data are displayed in Fig. 2 and in Extended Data 
Figs.3. and 4. 
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patterns:j, no-foil; k-m, trefoil; n, cinquefoil; 0, p, hexafoil. The solidification 
direction inc-—pis out of the image (z axis), as indicated by the red dotted 
circles. The images in c-p show the x-y plane cross-section of the simulation 
domain repeated once in each direction. The template pillars are displayed as 
semitransparent grey inaandb, andas black in c-p, while AgCl is shown as 
yellow, and KClas blue. All scale bars, 1 um. 


to intricate and rather unexpected patterns. Given the classes of pat- 
terns observed, we suggest it would be interesting to consider whether 
templates could modify the structure of metal-dielectric eutectics (for 
example, Ge-Al and Cu-Cu,P) to form non-reciprocal metasurfaces’, 
Co-based eutectics to form magnetic spin-ice systems®’, and mechani- 


cally robust eutectics such as NiAl-Mo or Al,O;-ZrO, to form mechanical 
microlattices”. It is important to keep in mind that the template can 
provide additional functionality (for example, being formed froma 
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Fig. 4| Pattern formation by a monolayer colloidal crystal template. a, SEM 
image of the monolayer silica colloidal crystal (sphere diameter d=560 nm) 
template. b, Schematic of eutectic-infilled monolayer colloidal crystal 
template (see Methods for details). c, SEM image of the trefoil pattern (a 
schematic of the Archimedean kagome lattice is overlaid on the image at 
bottom left) obtained whenA=d(A=500 nm). d, SEM image of the hexafoil 
pattern obtained whendA<d(A=160 nm). The solidification direction inc and 
dis into the image (z axis), as indicated by the black crossed circle. All scale 
bars,1pm. 


Nature | Vol577 | 16 January 2020 | 357 


Article 


diamagnetic material), and also that in some systems it is possible for 
one of the eutectic phases to be selectively etched and replaced with 
a different material”. 
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Methods 


AgCI-KCl eutectic synthesis 

As-received AgCl (99.999%, Sigma Aldrich) and KCl (99.99%, Sigma 
Aldrich) were mixed as per the binary eutectic composition™ of 
81.77 wt% AgCl and 18.23 wt% KCI, and melted in a glass vial (covered 
with aluminium foil to exclude light) ona hotplate at 470 °C for 1h, and 
subsequently cooled to room temperature. 


Fabrication of pillar templates 

Under dark-room conditions, SU-8 (MicroChem, SU-8 2000 
series) photoresist was mixed with 0.5 wt% photo-initiator, 
cyclopentadienyl(fluorene)iron(I1) hexafluorophosphate (Aldrich). 
Indium tin oxide coated glass substrates (with a sheet resistance of 
70-100 Q; Delta Technologies Ltd) were cleaned with acetone and 
isopropanol, followed by O, reactive ion etching (RIE) at100 W for5 min 
(with an O, flow of 19.6 cm? min‘“‘at STP at 10 mtorr chamber pressure; 
Plasma-Therm 790 MF RIE Plasma System). Immediately after RIE, the 
SU-8 solution was spin-coated (2,000 r.p.m. for 30s) on the substrates. 
These SU8 layers were soft baked on hotplates first at 65 °C for 10 min 
and then at 90 °C for 20 min, followed by slow cooling. A laser interfer- 
ence lithography setup* (laser output of 5.0 W, frequency-doubled 
Nd:YVO, laser, 532 nm) was used to create a hexagonal pattern in the 
SU8 film, by focusing three identical beams (arranged at 120° to each 
other) onto the sample plane for 0.8 s. The parameters of this pattern 
(with a lattice constant of 760-810 nm, and diameter of 500-620 nm) 
were adjusted by controlling the exposure time and the incident angles 
of each beam. All the beams were linearly polarized in their incident 
planes. After exposure, the samples were oven-baked at 85 °C for 20 min 
in dry air followed by slow cooling. The post-baked samples were then 
developed in propylene glycol monomethyl ether acetate for 4 h, fol- 
lowed by rinsing in an isopropanol bath for 10 min. The samples were 
gently blow dried with N, and then baked at 90 °C for 15 min. To cre- 
ate the pillar template, Ni electrodeposition was carried out within the 
SU8 scaffold. The electrodeposition was performed potentiostatically 
at -1.8 Vina commercial sulfamate nickel plating solution (Transene, 
SN-10). A drop of isopropanol was put on the scaffold before dipping it 
inthe electroplating solution to improve the infiltration of the solution 
into the hydrophobic scaffold. By controlling the electrodeposition 
time, we were able to control the pillar height. The SU8 scaffold was then 
etched using O, plasma RIE (Nordson March RIE Plasma System) with 
500 mtorr pressure and 200 W power for 15 min, thus exposing the Ni 
pillars. Subsequently, the exposed pillars were surface-protected with 
aconformal coating of -25 nm of alumina using atomic layer deposition 
(300 cycles of the standard recipe; Savannah S100 Cambridge Nanotech). 
The usable spot size of the pillar template was about 2 mm in diameter. 


Eutectic solidified in pillar templates 

Directional solidification experiments were performed using two tech- 
niques, aLinkam THMS 600 hot-stage (with a TMS 94 controller) that 
sets a cooling rate, and a tube-furnace setup” where a syringe pump 
draws the samples out of the heated furnace at fixed speeds. The rate 
of solidification (controlled by either a set cooling rate or the draw 
rate) was varied to obtain a certain of the eutectic (see Extended Data 
Fig. 1a). With the hot-stage setup, the stage was first heated to 470 °C 
and then the sample was placed on it. For the tube-furnace setup, the 
temperature was first set to 550 °C, and then the samples were placed 
at the location where the temperature reached 470 °C. A small piece 
of the prepared eutectic (around 150 mg) was placed on top of the 
pillar templates (see schematic in Extended Data Fig. 1b), and heated 
to 470 °C for 8-10 min to enable complete infiltration of the molten 
eutectic. Partial infilling was obtained with the dwell time less than 
5min. Subsequently, the infilled samples were directionally solidified at 
various rates, such that the nominal direction of solidification was along 
the axis of the pillars, until the sample cooled to room temperature. 


Microstructure characterization 

Directionally solidified solid eutectic samples were peeled off fromthe 
substrate and their bottom surface was characterized (see schematic 
in Extended Data Fig. 1b) by scanning electron microscopy (SEM) using 
a Hitachi S4800-SEM. SEM images from all the samples are shown in 
Extended Data Figs. 3 and 4. Note that we did not coat the samples dur- 
ing the SEM visualization: to visualize the cross-section, the samples 
were mechanically cleaved. After SEM imaging, the samples were stored 
inan argon glove-box to minimize the corrosive reaction between the 
chloride salts and the exposed Ni pillars. 


Image analysis 

The average lamellar spacing values (from samples without pillar tem- 
plates) were determined by performing the fast Fourier transform 
function of ImageJ onthe SEM images. For templated eutectic samples, 
A was determined outside the template region (see Extended Data 
Fig. Ic), whereas g and the emergent patterns were determined from 
SEM images taken around the centre of the pillar template samples. 
Patterns were analysed using a custom MATLAB code based on ref. *°. 
The SEM images were first false-coloured to match the colour scheme 
of the simulated patterns, and the false-coloured SEM images were used 
in further analysis. These images were divided into unit cells and these 
unit cells were stacked together, followed by averaging the intensity 
at each pixel (shown in Extended Data Fig. 5). Moreover, the deviation 
between the simulated pattern and the SEM image was calculated for 
each unit cell and mapped over the entire SEM image. The unit cells 
having the lowest deviation are denoted in blue and those with the 
greatest deviation in red. Cells where the template pillars appeared 
damaged were not included in the analysis. 


Eutectic solidified in colloid template 

Silica microspheres (Fibre Optic Center Inc.) were dispersed in etha- 
nol (1% by weight). Using a vertical deposition method”, the 560 nm 
diameter silica colloids were deposited as a monolayer (2D colloidal 
crystal) and as multilayers (3D colloidal crystal) on (piranha cleaned) 
silicon substrates in an incubator (Isotemp 125D, Fisher) set at 50 °C. 
Multilayer regions were scratched off before the directional solidifica- 
tion experiments. The eutectic was side-infilled (as demonstrated in 
ref.*) into the template, using a hotplate set at 470 °C, and subsequently 
directionally solidified using the hot-stage setup. The patterns were 
imaged with top-view SEM. 


Thermal profile simulations 

Thetemperature profile in the eutectic during solidification was calcu- 
lated using COMSOL in order to evaluate the assumptions regarding the 
orientation of the solidification front. For the pillar template case, the 
heat equation was solved in atwo-dimensional axisymmetric domain 
consisting of (from bottom to top) an aluminium plate (1cm thick, 8cm 
in diameter, thermal conductivity («) =238 W mK”), aglass substrate 
(0.7 mm thick, 20 mm in diameter, k=1.38 W mK”), a pillar-eutectic 
composite layer (6 pm thick, 2mm in diameter, k=23.4 Wm7K7), and 
a eutectic overlayer (a truncated hemisphere with a base diameter 
of 5mm and a height of 2 mm). Unless otherwise noted, the thermal 
conductivity values were assumed from COMSOL’s default material 
library. The thermal conductivity of the pillar-eutectic composite 
(k= 23.4 W mK‘, as previously stated) was determined by weight- 
ing the thermal conductivity of each material by its volume fraction— 
namely, the Ni pillar («=90.7 WmK “1, taken froma pure Ni value) and 
the eutectic (k=3.25 Wm 'K“, based on the solid value”’)—along with 
an assumption ofa volume fraction of 23% and 77%, respectively, which 
is for the case when the pillar diameter and the edge gap are equal. 
The thermal properties of the -25 nm alumina layer on the surface of 
the Ni pillars were not considered. Since the overlayer would solidify 
beforethe composite layer, the overlayer is assumed to have the average 
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thermal conductivity of AgCI-KCI eutectic solid (k=3.25Wm71K7)*.A 
cooling rate of 10 °C min” was enforced on the bottom surface of the 
aluminium plate (that is, the cooling stage). Newton’s law of cooling 
was applied to all other surfaces, with an air temperature of 300 K and 
a thermal transfer coefficient of 1O Wm7K 7? (ref. 7°). The evolution of 
the shape of the solidification front (approximated by the eutectic 
temperature isocontour at 591.73 K) was calculated using element 
sizes, ensuring a minimum of five elements across each feature. A 
simulation was also performed with a eutectic thermal conductivity 
corresponding to the liquid eutectic (x= 0.45 W mK)”. The result 
was qualitatively similar to that of k=3.25 W mK ‘case, but the tilt of 
the solidification front was more pronounced in the outer region of 
the disk and there was almost no tilt in the inner region. While we find 
that changing thermal conductivity has minimal impact on the shape 
of the solidification front, we note that there are other simplifications 
madein our model. First, the electrodeposited Ni pillars probably havea 
lower thermal conductivity than pure Ni, and second, the alumina layer 
will serve as athermal barrier between the pillars and the eutectic (the 
thermal conductivity of amorphous alumina grown by atomic layer 
deposition is” ~2 W mK‘, similar to the thermal conductivity of the 
eutectic). This alumina layer may reduce thermal coupling between 
the pillars and the solidifying eutectic. For simulations of monolayer 
silica colloidal crystal templates, the substrate was instead assumed 
to bea silicon layer (0.6 mm thick, k=130 W mK’). The bulk layer 
was assumed to be a composite of silica colloidal crystal and eutectic 
(560 nm thick, K=2.8 W m'K*) with no overlayer. 


Phase-field simulations 


Solidification of the AgCI-KCl eutectic along the axis of the pillars was 
simulated using a template-directed eutectic solidification model 
based on the phase-field model developed by Folch and Plapp” and 
as applied in other templated-eutectic cases*””. Model parameters, 
including material properties, were taken from refs.””*. Simulations of 
template-directed solidification were conducted over computational 
domains representing a unit cell of the hexagonal lattice of pillars, with 
dimensions of 780.6 nm and 1,352 nm and an edge gap of 220 nm. Peri- 
odic boundary conditions were assumed at the unit-cell boundaries, 
and no-flux boundary conditions were imposed at the template-eutec- 
tic interfaces through the use of the smoothed boundary method“. The 
initial lamellar spacings were chosen such that their integer multiples 
(1 to 7) would fit in the 1,352 nm domain width (see Fig. 2c-i), and the 
solidification velocities were set such that they would result in these 
lamellar spacings in the absence of a template. These initial lamellae 
act as asolid seed inthe simulations. The grid size of the computational 
domainis set to provide sufficient resolution over the diffuse interface, 
which must in turn be sufficiently smaller than the lamellar spacing, 
to ensure numerical accuracy without unnecessary expenditure of 
computational resources. Thus, the simulations with smaller lamellar 
spacing have higher resolutions, resulting in the sharper appearance 
of interfaces than those with larger spacing in Fig. 2. A linear thermal 
gradient of 10° K m™‘ was applied in the solidification direction. This 
thermal gradient is probably larger than that found in the physical 
system, but it allowed the simulation to reach a steady-state struc- 
ture more quickly without affecting the final morphology. The linear 
thermal gradient forced the solidification front to be perpendicular to 
the axis of the pillars (that is, parallel to the x-y plane; see Fig. 2a, b). 
The solidification of the eutectic was simulated along the axis of the 


pillars (that is, along the z axis; see Fig. 2a, b) until a stable, steady- 
state structure was attained. While this model does not explicitly 
specify the contact angle, it closely approximates the model that 
specifies a 90° contact angle ona flat surface of the template material, 
corresponding to a system with both solid phases having the same 
interfacial energy. This assumption of equal interfacial energy canbe 
justified by examining the experimental images, which show most of 
the interfaces intersecting the pillar surface at approximately 90°. 
Phase-field simulations were only performed for the solidification 
direction parallel to the axis of the pillars because a tilted solidifica- 
tion front requires a much larger computational domain. Phase-field 
simulations were not performed for the monolayer colloidal-crystal 
case because the level of resolution necessary to resolve the evolu- 
tion in the small gaps between spheres made the simulation costly. 


Data availability 


Data that support the findings of this study are available within the 
paper, and from the corresponding author on reasonable request. 


Code availability 


The phase-field simulations were performed using a custom code writ- 
tenin Fortran 90. The code is available from K.T. (kthorn@umich.edu) 
on reasonable request. 
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Extended Data Fig. 1| Directional solidification of AgCI-KCI lamellar 
eutectic and experiment schematic for pillar-template-directed eutectic 
solidification. a, Plot of the average lamellar spacing (A) of AgCI-KCl eutectic 
directionally solidified using the hot-stage setup at set cooling rates (plottedin 
orange) and the tube-furnace setup at set draw rates (plotted in blue). 

b, Experiment schematic for infilling the pillar template with eutectic, 


Infilling and directional solidification 


subsequent directional solidification of the eutectic inthe pillar template, and 
characterization of the resultant microstructures. c, SEM image showing the 
eutectic solidified within and outside the pillar template. The solidification 
direction is out of the image (z axis), as indicated by the red dotted circle. 

See Methods for details. Scale bar, 1 zm. 
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Extended Data Fig. 2 | Nomenclature of the patterns. a—d, Schematic showing 
the number of KCI spokes per unit cell of the pillar template for a, trefoil (three), 
b, quatrefoil (four), c, cinquefoil (five) and d, hexafoil (six) patterns. The KCI 
spokes are coloured blue, pillars black, and the unit cell is marked by red dashed 
lines. 
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Extended Data Fig. 3| No-foil and trefoil patterns. UncolouredSEMimagesof — givenunder eachimage. The solidification direction is out of the image (zaxis), 
the samples, showing the variety of no-foil (a-d) and trefoil (e—j) patterns as indicated by the red dotted circle. Allscale bars, 11m. 
mapped in Fig. 3. Values of edge gap, g, and average lamellar spacing, A, are 
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Extended Data Fig. 4 | Quatrefoil, cinquefoil and hexafoil patterns. g,and average lamellar spacing, A, are given under each image. The 
Uncoloured SEM images of the samples showing the variety of quatrefoil (a, b), solidification direction is out of the image (z axis), as indicated by the red 


cinquefoil (c) and hexafoil (d-f) patterns mapped in Fig. 3. Values of edge gap, dotted circle. Allscale bars, 11m. 
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Extended Data Fig. 5 | Image analysis of local and long-range order inthe 
patterns. a, SEM image of atrefoil pattern. b, Histogram representing a simple 
counting of the number of spokes in each unit cell. The dominant feature inthe 
12 um x17 pm SEM image in ais the three-spoke pattern. We also include inthe 
histogram the number of unit cells that have missing pillars or other defects. 

c, Aunit cell of the phase-field-simulated trefoil pattern. d, Mean unit cell 
obtained by averaging the intensity at each pixel froma false-coloured version 
of the SEMimage ina. The experimental average matches the ideal simulated 
unit cell well. e, Map of deviation between the simulated trefoil pattern and 
each unit cell of the experimental image. The unit cells having the lowest 
deviation are denoted in blue and those with the greatest deviation in red. Cells 
where the template pillars appeared damaged were not included inthe 
analysis. f, Unit cell of the simulated cinquefoil pattern. g, Averaged unit cell 
obtained from the SEM image shown in Extended Data Fig. 4c. h, Unit cell of the 
simulated hexafoil pattern. i, Averaged unit cell obtained from the SEM image 
shown in Extended Data Fig. 4e. All scale bars, 1m. 
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Extended Data Fig. 6 | Eutectic phase boundary alignment and thermal 10 °C min”. The isothermal solidification front at the eutectic temperature 
profile in the bulk of the pillar template. a, SEM image of a cross-section, (591.73 K) is denoted by a thick black line; the eutectic temperature is also 
showing a tilted alignment of eutectic phase boundaries within the pillars. indicated by a black line in the key. The black arrow denotes the direction of 
b, Calculated thermal profile in the bulk of the template during directional solidification. Scale bar, 11m. 


solidification of eutectic-infiltrated Ni pillar template at a set cooling rate of 


a b c d e 
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Extended Data Fig. 7 | Effect of pillar diameter on the emergence of patterns. 
a-e, Phase-field-simulated steady-state patterns fora fixed lattice constant of 
780.6 nm, varying edge gaps, g, and lamellar spacing, A. These results suggest 
the possibility of obtaining new patterns by varying the parameter associated 
with pillar diameter as well as g/A. The solidification direction is out of the 
image (zaxis). The images show the x-y plane cross-section of the simulation 
domain repeated once in each direction. The template pillars are displayed as 


black, AgCl as yellow, and KClas blue. 
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Extended Data Fig. 8 | Phase-field simulations for different shapes of pillars. 
a-c, Avariety of patterns intemplated-eutectic obtained by changing the 
shapes and orientations of the pillars: a, oval; b, circle; c, same asa but rotated 
by 90°. Characteristic features depend on the pillar shape and orientation. The 
solidification direction is out of the image (z axis), as indicated by the red 
dotted circle. The images show the x-y plane cross-section of the simulation 
domain repeated once in each direction. The template pillars are displayed as 
black, AgClas yellow, and KClas blue. 
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Extended Data Fig. 9 | Solidification of AgCI-KCI eutectic ina monolayer 
silica colloidal crystal. a, An oblique-view SEM image of the monolayer silica 
colloidal crystal used as a template. b, Plan-view SEM image of a disordered 
structure observed in AgCI-KCl eutectic solidified (withA=1.7 um) withina 
monolayer silica colloidal crystal of diameter 560 nm. The solidification 
direction in bis into the image (z axis), as indicated by the black crossed circle. 
c, Calculated thermal profile in the bulk of the template during directional 
solidification of the eutectic-infilled silica colloidal crystal template at aset 
cooling rate of 10 °C min”. The isothermal solidification front at the eutectic 
temperature (591.73 K) is denoted by a thick black line; the eutectic 
temperature is also indicated by a black line in the key. The black arrow denotes 
the direction of solidification. d, Cross-sectional-view SEM image, with false 
colouring denoting AgClas yellow, and KCl as blue. All scale bars, 1 1m. 
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The impact of topological defects associated with grain boundaries (GB defects) on 
the electrical, optical, magnetic, mechanical and chemical properties of 
nanocrystalline materials’ is well known. However, elucidating this influence 
experimentally is difficult because grains typically exhibit a large range of sizes, 
shapes and random relative orientations® °. Here we demonstrate that precise control 
of the heteroepitaxy of colloidal polyhedral nanocrystals enables ordered grain 
growth and can thereby produce material samples with uniform GB defects. We 
illustrate our approach witha multigrain nanocrystal comprising a Co,;0, nanocube 
core that carries a Mn,O, shell on each facet. The individual shells are symmetry- 
related interconnected grains®, and the large geometric misfit between adjacent 
tetragonal Mn,O, grains results in tilt boundaries at the sharp edges of the Co,0, 
nanocube core that join via disclinations. We identify four design principles that 
govern the production of these highly ordered multigrain nanostructures. First, the 
shape of the substrate nanocrystal must guide the crystallographic orientation of the 
overgrowth phase’. Second, the size of the substrate must be smaller than the 
characteristic distance between the dislocations. Third, the incompatible symmetry 
between the overgrowth phase and the substrate increases the geometric misfit strain 
between the grains. Fourth, for GB formation under near-equilibrium conditions, the 


surface energy of the shell needs to be balanced by the increasing elastic energy 
through ligand passivation® ®. With these principles, we can produce a range of 
multigrain nanocrystals containing distinct GB defects. 


The geometric misfit strain in core/shell nanocrystals was investigated 
using nanocrystals of a polyhedral Co,O, core and a heteroepitaxial 
Mn,0, shell as a model system. We prepared ~11-nm-side Co,0, nano- 
cubes with {100} facets as substrates for the growth of Mn,0O, (Sup- 
plementary Fig. 1a). The formation of Mn,O, grains on the nanocube 
was carried out by the reaction of MnCl, or Mn(HCOO), in an organic/ 
aqueous (xylene/water) reverse micelle solution in the presence of 
oleylamine, oleic acid and HCI” (Supplementary Methods). X-ray 
diffraction (XRD) data confirm that the deposited phase is tetragonal 
Mn,0, (a= 6 =5.765 A, c= 9.442 A; JCPDS #80-0382) (Supplementary 
Fig. 1b), which has a spinel structure elongated along the c axis owing 
to the Jahn-Teller effect of Mn*’, with an electron configuration of Oe ; 
whereas the Co,O, core has a cubic spinel structure (a=b=c=8.084 A; 


JCPDS #42-1467). As shownin Fig. la, the lattice of Mn,O, {220} coincides 
with Co,O, {400} at a misfit of less than 1%, explaining the consistent 
values in the in-plane interatomic distance (d,,) for both phases 
(Supplementary Figs. 1-3, Supplementary Table 1, Supplementary 
Discussion). 

In Fig. 1a, the high-resolution high-angle annular dark-field scanning 
transmission electron microscope (HAADF-STEM) images and the 
Fourier-filtered images reveal that for each facet of the cubic core, a 
Mn,0, grain grows in (001) directions perpendicular to Co,O, {100}, 
and Mn,0, {110} are parallel to Co,0, {010}. The growth direction of 
the shell is guided by the lattice matching along six {100} surface planes 
of the Co;0, core, which leads to the segmentation of the shell into 
multigrains. Anirregularly shaped core induces an inconsistent lattice 
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Fig. 1| Epitaxially guided growth and gap closing of Mn,O, grains onaCo,0, 
nanocube.a, Left, relative orientation between Mn;O, and Co,0, unit cells. 
HAADF-STEM image (centre) and corresponding Fourier-filtered images (right) 
of the Co,0,/Mn,0, nanocrystal taken along the Co,0, (100) and Mn,O, (110) 
zone axes, Showing the relative orientation between the Co,0, core and Mn30, 
grains. Horizontal and vertical lattices are shown in the purple and pink box, 
respectively. The in-plane (outer) spotsin the FFT pattern are mapped ingreen 
(2.04.A) andthe out-of-plane (inner) spots are mapped in red (2.36 A).b, Gaps 
between adjacent Mn,O, grains resulting in disclinations at the Mn,O, GBs. The 
misorientation angle between neighbouring Mn,O, {112} planes is about 8.4°. 
The disclination line is shown asa black triangle with Frank vector w.c, The GBs 
observed in the HAADF-STEM image and the corresponding lattice spacing 
map. The green and red spots inthe FFT image were used to visualize the lattice 
spacing distribution in the nanocrystal. In the map, the interface between the 
Co,0, nanocrystal and the Mn,O, GBs is shown inthe same green colour as 
Co,0,, giving a lattice spacing at the interface of 2.85 A. (The lattice spacings of 
the Mn;0, {112} planes, which are not affected by the Co;0, core, range from 
2.96Ato3.09A (red)). 


coincidence, which prevents the formation of an ordered grain struc- 
ture’ (Supplementary Fig. 4a-c). 

The tetragonal symmetry of Mn,O, produces a gap with an angle of 
about 8.4° between the {112} planes of adjacent Mn,O, grains along the 
edges of the Co,0, core” (Fig. 1b). Twinned disclinations of Mn,O, {112} 
with respect to Co,O, {110} are thus formed around the boundary to 
close the gap. Accordingly, the fast Fourier transform (FFT) of a TEM 
image with the zone axis along Co;0, [100] shows the extension of 
the square lattice of Co,0, into Mn,O, (Fig. Ic). A similar FFT analysis 
with the zone axis along Co,0, [110] further supports the distortion of 
the Mn,0, lattice structure near the Co,0, nanocube edges (Supple- 
mentary Fig. 4d). Each Mn,0, grain has a truncated square pyramidal 
shape enclosed by {100} and {011} facets (Supplementary Fig. 1c, d). The 
overall morphology of each Co,0,/Mn,0O, multigrain nanocrystal can be 
described as a truncated octahedron consisting of a cubic Co,;0, core 
and Mn,O, grains orthogonally grown from the basal planes of the core. 

The overall growth process can be considered to be a generaliza- 
tion or extension of the Stranski-Krastanov (SK) mode growth of a 
thin film on a two-dimensional (2D) substrate to a finite-sized three- 
dimensional (3D) case. In SK growth, the thin film grows up toa critical 
layer thickness (A,), which is determined by the interplay between 
the strain and the surface energy, followed by 3D island growth®”®. 
From the perspective of extending the SK growth to 3D nanocrystal 
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Fig. 2| Extension of SK growth to 3D polyhedral substrates. a, b, Schematic 
illustration of traditional SK growth without geometric misfit strain (a) and 
extended SK growth with a 3D geometric CL induced by geometric misfit strain 
along the sharp edges of the core (b).c, TEM image of aCo,0,/Mn,0, 
nanocrystal without GBs. d, e, TEM images showing the effects of 
counteranions in manganese (II) precursors on the morphology of Mn3;04 
grains without islands (d) and with islands (e). Red and green lines in the high- 
resolution TEM images indicate the CL and the islands of Mn,O,, respectively. 
f, g, Monitoring of the development of grains and GBs while repeating the 
synthetic procedure using chloride (f) and formate (g) ligands with an11-nm 
core. h, i, STEM, TEM, FFT and Fourier-filtered images of Co,0,/Mn,0, 
nanocrystals synthesized using 11-nm (h) and 30-nm (i) Co;0, nanocubes as 
cores and with GBs longer than one critical thickness. The Fourier-filtered 
images show the induced dislocation and its spacing at the Mn,0, GB. 


substrates, as illustrated in Fig. 2a, b, the mechanism by which the grains 
grow to form the core/shell nanocrystals can be described as follows. 
The shell initially grows via a layer-by-layer mode on the facets of the 
polyhedral nanocrystal core, forming a coherent layer (CL) and accu- 
mulating epitaxial strain (Fig. 2a). When the growth thickness exceeds 
h,, the growth mode changes to island growth for strain relaxation. In 
particular, unless there is a GB that exerts geometric-misfit-induced 
stress (Fig. 1c), the Co,0,/Mn,0, core/shell nanocrystals with minimal 
lattice mismatch exhibit almost no CL characteristics because the strain 
energy is very small (Fig. 2c, Supplementary Fig. 2a). 

The CLs grown onthe facets of the core are differently oriented and 
meet each other at a certain angle (,) around the sharp edges of the 
core, forming tilt boundaries (Fig. 2b). Consequently, a geometric strain 
arises due to the mismatch between the CL lattices, whichis rendered 
by anisotropic strain relaxation of the shell material’. The mismatch 
is particularly pronounced in nanocrystals with anisotropic lattice core 
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Fig. 3 | GB defects in Co,0,/Mn,0, nanocrystals. a, 2D illustration of the 
atomic arrangement of the nanocrystal with both a GB and an island. Black 
rectangles represent the distorted Mn,O, unit cells viewed along the [110] 
direction. The Mn,O, {112} planes rotate around (220). b-d, Computer-vision- 
based image-processing results for diamond lattices consisting of Co,0, {220} 
or Mn,O, {112} planes (top) and histograms (bottom) showing the aspect ratio 
distribution of the diamond lattices in the outlined regions of the nanocrystals 
witha GB and without an island (b), with both a GB and an island (c), and without 
aGB and with an island (d). Colour maps show the distribution of strained 
diamond lattice cells. Inthe maps, the horizontal length (d,; Co,0, [100] 


(for example, cubic lattice) and an anisotropic lattice shell (for example, 
tetragonal lattice). The A, value of the CL observed for 2D epitaxy is 
modified in 3D epitaxy to the value at which the geometric misfit strain, 
in addition to the epitaxial strain, can be accommodated by the shell 
material. For example, the Mn,0, lattice at the GBs can accommodate 
the elastic distortion required to create a continuous disclination with 
alength of three Mn,O, unit cells (-2.9 nm; /h,)”°. Therefore, each Mn,O, 
shell can be divided into two regions: the ‘3D geometric CL’ with GBs 
and the islands grown farther from the CL (Fig. 2b). 

When the GB formation is driven energetically, precise management 
of the kinetic effects during the shell growth is the key to obtaining the 
desired coverage and morphology of the shell grains”. In our synthetic 
method using oleylammonium salts, the acid/base ratio of the solution 
and the concentration of counteranions can be varied (Supplemen- 
tary Fig. 5). The deposition rate—a kinetic parameter—and the surface 
energy of the growing nanocrystals—a thermodynamic parameter—are 
affected by both the acid/base ratio and the counteranion concentra- 
tion. The former is more sensitive to the acid/base ratio that controls the 
supersaturation level, and the latter is responsive to the counteranions 
bound tothe surface. Using this synthetic system, we can limit the shell 
growth in a near-equilibrium thin-film growth regime. 

By changing the counteranions that passivate the surface of the shell, 
the morphology of the grains could be controlled to produce CLs with 
(CoMn-CL+]) or without islands (CoMn-CL) (Fig. 2d, e, Supplementary 
Fig. 5c-e). To compare the stabilizing effects of different types of coun- 
teranions, MnCl, and Mn(HCOO), were tested. With HCOO ions, lateral 
growth (that is, wetting of the core surface) of the grains is promoted 
without the formation of the islands because the low surface energy 
can compensate for the energy expended on the epitaxial strain and 
even the large geometric strain to form the GBs (Fig. 2d). On the other 
hand, when MnCl, is used, Mn,O, islands with high surface energy tend 
to grow on the CLs to relax the strain, and the creation of CLs/GBs is 
inhibited (Fig. 2e). For almost the same elemental composition, the 
Mn,0, shell of CoMn-CL has larger coverage, meaning not only more 
grains and GBs, but also a lower height/edge-length ratio than those 
of CoMn-CL+I. This behaviour of the shell material, which switches the 
growth mode between lateral and vertical, depending on the ligand 
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direction) of the lattice cells is divided by the vertical length (d,;Co,0, [010] 
direction) to distinguish the orientations of the grains. In the histogram, ratios 
greater than1are used to obtain the d,,,,/d,, values; that is, the larger of d,/d, and 
d,/d,. The insets show 3D models illustrating the strain field of the Mn,0, 
grains. Vertical dotted lines represent the aspect ratio (1.16) of unstrained 
Mn,0,.e, XRD patterns and calculated peak profiles of Co,0,/Mn,0, 
nanocrystals with GBs. Individual peak profiles are Mn,O, (112) (red), (020) 
(blue), (013) (orange) and Co,0, (220) (green). The full-width at half-maximum 
of the Mn,O, (112) peak increases considerably from 0.90° for CoMn-CL+Ito 
1.56° for CoMn-CL. 


stabilization capability, minimizes the total free energy while incorpo- 
rating the geometric misfit strain’. To further drive near-equilibrium 
growth, we designed experiments that repeat the shell growth under 
the same supersaturated conditions (Fig. 2f, g, Supplementary Discus- 
sion). Asa result, the shell coverage increases and subsequently the GB 
number also rises, producing nanocrystals with a near-equilibrium 
structure in which the grains forma closed loop. 

Furthermore, we could tailor the structures of the GB defects (for 
example, disclinations and dislocations) either by using different types 
of counteranions that can promote non-equilibrium shell growth or 
by changing the core size (Fig. 2h, i, Supplementary Figs. 5f, 6, Supple- 
mentary Discussion). Because of the stress relief through the formation 
of dislocations at the core/shell interface or in the GBs, the GBs are 
extended along with the periodic/zigzag dislocations. 

The structure of the GB defects is illustrated using the area marked 
by the green square in the HAADF-STEM image of a nanocrystal with 
both GBs and islands in Supplementary Fig. 7a (Fig. 3a). Here, d,, and 
dy. are defined as the interatomic spacings among the M”* cations 
(M=Co, Mn) that are parallel (in-plane) and perpendicular (out-of- 
plane) to the interface, respectively, on the 2D projection of the lattice. 
The d,,,/d,, ratio of 1.03 and the square-like shape of the unit cell at the 
GB in Fig. 3a result from the rotation of Mn,O, {112} planes around the 
Mn,0,(220)axes. The FFTs of the STEM images of all the CoMn-CL and 
CoMn-CL+I samples with GBs taken along the Co,0, [100] zone axis 
consistently show the Bragg peaks of Mn,O, (112) dispersed towards 
Co,0, (220) reflections (Supplementary Fig. 7). 

Strain field mapping was conducted by image-processing the atomic- 
resolution HAADF-STEM image (Fig. 3b-d, Supplementary Fig. 8, Sup- 
plementary Methods). The extent of local distortion is presented in 
terms of the aspect ratio of a diamond lattice consisting of Co,0, {220} 
or Mn,0, {112} planes. The colour map images clearly show that the 
strain field has narrow bands of reduced d,,,,/d,, lattices at the GBs 
around the nanocube edges. The statistical distributions of the aspect 
ratios are extracted from the regions defined by the white dashed out- 
lines, which correspond to the lateral views of two adjacent grains 
sharing a GB. The most frequent d,,,/d;, values for the nanocrystals 
with islands are near 1.16, which is close to the c/(./2 a) value of the 
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regular Mn,O, lattice (also see Supplementary Fig. 9). On the other 
hand, lower values are observed for the nanocrystals without islands, 
which can be attributed to the distribution of tensile and compressive 
strains induced along Mn,0O, [220] and [004], respectively, owing to 
the gap closing. 

The local strain distribution is reflected in the electron diffraction 
patterns and powder XRD data througha reasonably uniform grain con- 
figuration inthe nanocrystals (Fig. 3e, Supplementary Figs. 9, 10). The 
GB areas and gap angles are almost identical for all the GBs, implying 
that the residual stresses associated with the gap closing between two 
grains near each GB are almost the same. At a given residual stress, an 
island appears to have less normal strain per plane thana CL according 
tothe strain field mapping. The Mn,O, (112) XRD peak for the CoMn-CL 
nanocrystals is broader than that for the CoMn-CL+I nanocrystals at a 
similar Mn content, which indicates a higher proportion of nonuniform 
deformation due to the smaller volume or the absence of islands. Such 
uniformity in the GB structure provides a valuable opportunity toinves- 
tigate the properties of strained GB structures, which are otherwise 
hard to achieve. For instance, one can obtain the correlation between 
the microscopic defect structures and the ensemble properties. 

When the gap closes, we observe not only normal strain and the 
accompanying Poisson effect, but also shear strain and rotation of 
unit cells at the GBs due to the high elastic anisotropy of orthotropic 
Mn,0,~. The more accurate strain tensor measurement”, which is of 
the order of 0.5 nm, shows that the normal strain and the rotation are 
the principal deformation components (Fig. 4, Supplementary Fig. 11; 
see Supplementary Methods for the algorithm). In both the nanocrys- 
tals with and without islands, the aspect ratios of the unit cells at the 
GBs are close to unity. Furthermore, the sizes of the unit cells increase 
with increasing distance from the Co,0,/Mn,0, interface, with very 
similar values are for unit cells at the same distance. Interestingly, the 
Mn,0, shell accommodates a large 3D strain (~8% for each axis) per 
GB without producing any dislocations. The value is quite large com- 
pared to that of decahedral Au nanoparticles—another material well 
known to exhibit GBs within nanocrystals“. Moreover, GBs in the form 
of disclinations could also be created in the core/shell nanocrystals 
of other material combinations, such as Fe;0,/Mn,0, Mn;0,/Co3;0,, 
Fe,0,/Co,0, and Pd/Au, via geometric misfit strain (Supplementary 
Figs. 12-15, Supplementary Discussion). 
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rectangles) showing the change in aspect ratio and volume of the unit cells. 
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over the line traces shownind. 


Thin films in 2D SK growth mode often exhibit periodic ripple pat- 
terns along with island formation, which is mainly attributed to the 
epitaxial strain’. Pre-patterned substrates with ordered pit arrays 
and stripes are used for directing the island growth and ordering”. 
In our demonstration of the 3D analogy of the SK growth, each of the 
well-defined polyhedral nanocrystals (that is, the core) acts as a pre- 
patterned substrate. Consequently, a thin film (that is, the shell) grows 
with a strain field patterned along the sharp edges owing to the geo- 
metric misfit strain in addition to the epitaxial strain. This 3D network 
of strains also organizes the grains into a 3D superlattice”’. Given that 
the self-organization of lattice domains in 2D epitaxial thin films has 
been a key element for various physical phenomena, this utilization 
of geometric misfit strains in 3D shell growth could facilitate the engi- 
neering of the properties of nanocrystalline materials for improved 
applications, suchas in mechanics, catalysis and dielectrics” °°. 
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Strain and rotation mapping using real-space peak fitting, geometric 
phase analysis and other Fourier filtering measurements were per- 
formed using custom MATLAB scripts. The raw image data and analysis 
codes are available upon reasonable request. 


Acknowledgements Synthesis and image analysis of the nanocrystal samples were supported 
by the Research Center Program of IBS in Korea (IBS-ROO6-D1 to T.H.; IBS-ROO6-G1 to Y.-E.S. 
and K.K. The theoretical part of this work was also supported by the US Department of Energy, 
Office of Science, Office of Basic Energy Sciences, Materials Sciences and Engineering 
Division, under contract number DE-ACO2-05-CH11231 within the Physical Chemistry of 
Inorganic Nanostructures Program (KC3103 to A.P.A.). The computational work was supported 
by the Supercomputing Center/Korea Institute of Science and Technology Information with 
supercomputing resources including technical support (KSC-2014-C3-037 to K.K.). Work at the 


Molecular Foundry was supported by the Office of Science, Office of Basic Energy Sciences, 
of the US Department of Energy under contract number DE-ACO2-05CH11231. Experiments at 
PLS-II were supported in part by MSIP and POSTECH. 


Author contributions M.H.O., M.G.C., A.P.A. and T.H. conceived the research. M.H.O. and 
M.G.C. designed and performed the experiments and analysed the results. |.P. and K.K. 
performed the density functional theory calculations and analysis. Y.P.K. and S.M. conducted 
the computer-vision-based image processing of HAADF-STEM images. C.O. conducted the 
strain tensor measurements for the HAADF-STEM micrographs. M.G.K. and B.J. contributed to 
the analysis of X-ray absorption spectroscopy and X-ray photoelectron spectroscopy data, 
respectively. DY.C., J.M-Y., D.K., X.W.G. and Y.-E.S. discussed and commented on the results. J.J. 
and J.H. prepared the samples for the TEM analysis. M.H.O., M.G.C., D.K., A.P.A. and T.H. wrote 
the manuscript. A.P.A. and T.H. supervised the project. All the authors commented on the 
manuscript. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-019- 
1899-3. 

Correspondence and requests for materials should be addressed to A.P.A. or T.H. 

Peer review information Nature thanks Laura Bocher, Yong Ding and the other, anonymous, 
reviewer(s) for their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Article 


Importance and vulnerability of the world’s 
water towers 


https://doi.org/10.1038/s41586-019-1822-y W. W. Immerzeel'?®, A. F. Lutz'??&*, M. Andrade*“, A. Bahl®, H. Biemans*, T. Bolch’, S. Hyde®, 
S. Brumby’, B. J. Davies®, A. C. Elmore’, A. Emmer?®, M. Feng”, A. Fernandez", U. Haritashya”, 
J. S. Kargel®, M. Koppes™, P. D. A. Kraaijenbrink’, A. V. Kulkarni”, P. A. Mayewski’’, S. Nepal”, 
P. Pacheco”, T. H. Painter’, F. Pellicciotti”°, H. Rajaram”, S. Rupper”2, A. Sinisalo”, 


A.B. Shrestha”, D. Viviroli?*, Y. Wada“, C. Xiao”®, T. Yao"? & J. E. M. Baillie® 


Received: 27 May 2019 


Accepted: 11 November 2019 


Published online: 9 December 2019 


Mountains are the water towers of the world, supplying a substantial part of both 
natural and anthropogenic water demands’”. They are highly sensitive and prone to 
climate change”, yet their importance and vulnerability have not been quantified at 
the global scale. Here we present a global water tower index (WTI), which ranks all 
water towers in terms of their water-supplying role and the downstream dependence 
of ecosystems and society. For each water tower, we assess its vulnerability related to 
water stress, governance, hydropolitical tension and future climatic and socio- 
economic changes. We conclude that the most important (highest WTI) water towers 
are also among the most vulnerable, and that climatic and socio-economic changes 


will affect them profoundly. This could negatively impact 1.9 billion people living in 
(0.3 billion) or directly downstream of (1.6 billion) mountainous areas. Immediate 
action is required to safeguard the future of the world’s most important and 
vulnerable water towers. 


The term ‘water tower is used to describe the water storage and supply 
that mountain ranges provide to sustain environmental and human 
water demands downstream!”. Compared to its downstream area, a 
water tower (seasonally) generates higher runoff from rain as a result 
of orographic precipitation and delays the release of water by storing 
itin snowand glaciers (because of lower temperatures at high altitude) 
and lake reserves. Because of their buffering capacity, for instance by 
supplying glacier melt water during the hot and dry season, water tow- 
ers provide a relatively constant water supply to downstream areas. We 
define a water tower unit (WTU; see Methods, Extended Data Fig. 1) as 
the intersection between major river basins and a topographic moun- 
tain classification based on elevation and surface roughness*. Since 
water supply and demand are linked at the river basin scale, the basin 
is the basis for the WTU. One WTU can therefore contain multiple topo- 
graphically different mountain ranges and we assume that it provides 
water to the areas in the downstream river basin that are hydrologically 
connected to the WTU (Extended Data Fig. 1, Extended Data Table 1 and 
2). Subsequently, we consider only cryospheric WTUs by imposing 
thresholds on satellite-derived snow-cover data’ and a glacier inven- 
tory’, because the buffering role of glaciers and snow and the delayed 


supply of melt water is a defining feature of water towers. Consequently, 
there are regions (for example, in Africa), which do contain mountain 
ranges, but because of their small snow and ice reserves they do not 
meet the WTU criteria. In total, we define 78 WTUs globally (see Meth- 
ods), whichare hometo more than 250 million people. However, more 
than 1.6 billion people live in areas receiving water from WTUs, which 
is about 22% of the global population’ (Fig. 1). 

Water towers have an essential role in the Earth system and are par- 
ticularly important in the global water cycle’”. In addition to their water 
supply role, they provide a range of other services". About 50% of 
the global biodiversity hotspots on the planet are located in mountain 
regions”, they containa third of the entire terrestrial species diversity”, 
and are extraordinarily rich in plant diversity"*. Moreover, mountain 
ecosystems provide key resources for human livelihoods, host impor- 
tant cultural and religious sites, and attract millions of tourists glob- 
ally’. Economically, 4% and 18% of the global gross domestic product 
(GDP) is generated in WTUs and WTU-dependent basins respectively”. 
Furthermore, mountains are highly sensitive to climate change** and 
are warming faster than low-lying areas owing to elevation-dependent 
warming”. Climate change therefore threatens the entire mountain 
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Fig.1| The WTI, the population in WTUs and their downstream basins. The 
WTI, derived from the Sl and the DI, is shown for all 78 WTUs, in combination 
with the shaded total population in all WTU-dependent river basins. Labels 


ecosystem. Worldwide, the vast majority of glaciers are losing mass”, 
snow melt dynamics are being perturbed’*”, and precipitation and 
evapotranspiration patterns are shifting, all leading to future changes 
inthe timing and magnitude of mountain water availability”. Besides, 
the combination of cryosphere degradation and increases in climate 
extremes implies changing sediment loads affecting the quality of 
water supplied by mountains”. 

Not only are the world’s water towers crucial to human and 
ecosystem survival, the steep terrain in combination with extreme 
climatic conditions, and in some regions seismic or volcanic activ- 
ity, frequently triggers landslides, rock fall, debris flows, avalanches, 
glacier hazards and floods™”>. Since 2000, over 200,000 people have 
died in WTUsasa result of natural disasters”. Climate change, incom- 
bination with population growth, urbanization and economic and 
infrastructural developments, is likely to exacerbate the impact of 
natural hazards and further increase the vulnerability of these water 
towers”??7°°, 


Quantifying importance of water towers 


Consequently, there is astrong need for a consistent framework within 
which to assess and rank the importance and vulnerability of individual 
WTUs in order to guide global research, as well as conservation and 
policy-making efforts. Here we develop such a framework according 
to quantifiable indicators for both the water supply and demand sides 
of each WTU. Conceptually, a WTU is deemed to be important when 
its water resources (liquid or frozen) are plentiful relative to its down- 
stream water availability and when its basin water demand is high and 
cannot be met by downstream water availability alone. Ideally, suchan 
assessment would require a global-scale, high-resolution, fully coupled 
atmospheric-cryospheric-hydrological model that can resolve the 
interactions between extreme topography and the atmosphere, fully 
account for snow and ice dynamics, and incorporate anthropogenic 


100 
500 


indicate the five water towers with the highest WTI value per continent. The 
insets show the number of people living in WTUs as a function of elevation and 
of the downstream population’s proximity to the WTUs’. 


interventions in the hydrological cycle. It would also require models 
that include socio-economic impacts on sectoral water demands and 
a spatially explicit attribution of water sources (for example, meltwa- 
ter, groundwater, surface runoff) to water use. Although excellent 
progress has been made in specific regions and for specific sectors”, 
at the global scale this is not yet feasible. We therefore derive indices 
covering relevant drivers for both the water supply and demand of 
aWTU’s water budget (see Methods), which we combine to derive a 
water tower index (WTI). 

The supply index (SI) is based on the average of four indicators that 
are quantified for each WTU: precipitation, snow cover, glaciers and 
surface water (Fig. 2a, Extended Data Table 3, Supplementary Table 1 
and Methods). If the precipitation in the WTU (Extended Data Fig. 3a) 
is high relative to the overall basin precipitation and if the inter-annual 
and intra-annual variation is low (that is, the supply is constant),a WTU 
scores highly on the precipitation indicator. If a WTU has persistent 
snow cover (Extended Data Fig. 3b) throughout the year and the snow- 
pack shows lower inter-annual variation, this will result in a high snow 
indicator. Similarly, if the total glacier ice volume (Extended Data 
Fig. 4a) and glacier water yield ina WTU are high relative to the basin 
precipitation then a WTU has a high glacier indicator value. Finally, 
we assess the amount of water stored in lakes and reservoirs ina WTU 
(Extended Data Fig. 4b) compared to basin precipitation to derive a 
surface water indicator. 

There is considerable variability in the power of WTUs to supply 
water. In Asia, the Tibetan Plateau has the highest ranking because of 
the large amounts of water stored in lakes, but a large part of the Tibetan 
Plateau is endorheic and its water resources are disconnected from the 
downstream demand. The Indus WTU has an important water-supply- 
ing role witha balanced mix of precipitation, glaciers, snow and surface 
water. In Europe, the Arctic Ocean islands, Iceland and Scandinavia have 
extensive stocks of water stored in their WTUs. Iceland stands out with 
some of the thickest glaciers in the world anda glacier ice storage (about 
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Fig. 2 |The Sl and DI.a, b, The SI (a) and the DI (b) of each WTU grouped by 
continent and ordered by SI or DI value, respectively. The stacked bars show the 
four indicator values for surface water (L), glacier (G), snow (S) and 
precipitation (P).Inb, the stacked bars show the four indicator values for 


1,027 km’) that is 15 times as large as its total annual WTU precipitation 
(about 67 km?). In South America, the mountain ranges (Extended 
Data Tables 1, 2) supplying the Southern Chilean Pacific coast regions 
and La Puna Region are the most prominent water towers, because of 
large glacier ice reserves and high orographic precipitation rates and 
because of the large amount of water stored in lakes (in the La Puna 
region). The Northwest Territories and Nunavut, Fraser and the Pacific 
and Arctic coast are the key WTUs in North America. Inthe Northwest 
Territories and Nunavut the relevance of the WTU is primarily driven 
by the abundance of glaciers, snow and surface water. However, the 
precipitation indicator value is low, meaning that mountain precipita- 
tion is low relative to the overall basin precipitation. 

To derive a demand index (DI) for each WTU, we quantify the monthly 
water requirements to be supplied by the water towers to sustain the 
WTUbasin’s net sectoral water demand for irrigation, industrial (energy 
and manufacturing) and domestic purposes, and monthly natural water 
demand, relative to the total annual demand (Fig. 2b, Extended Data 
Table 4, Supplementary Table 1). Monthly sectoral water requirements 
are estimated by subtracting the monthly water availability down- 
stream (ERAS precipitation minus natural evapotranspiration™) from 
the monthly net demands”. The Dl is the average of the four indicators 
(see Methods). Figure 2b demonstrates considerable variability, glob- 
ally and within continents, inthe demands that WTUs need to sustain. 
Irrigation water demands are the highest of the four demand types, 
and this is relatively consistent across the continents. The Asian river 
basins, specifically the heavily irrigated and densely populated basins 
suchas the Indus, Amu Darya, Tigris, Ganges-Brahmaputra and Tarim, 
score more highly on the DI than other basins across the world and 
they score highly on each sectoral demand indicator. In those basins, 
the water required to close the gap between demand and downstream 
supply may also originate from (unsustainable) groundwater use*”>. 
However, in those cases, when there is a large water gap being (partly) 
closed by unsustainable groundwater pumping, the WTU water sup- 
ply is critical both to meet the demand and to recharge the aquifers. 
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natural (Dyaz), industrial (D,yp), domestic (Dpoy) and irrigation demands (Dp). 
Calculation details of the indicators and indices are provided in Extended Data 
Tables 3, 4. 


In Europe, the Volga and Ural in Russia show the highest DI values, 
including high values for the natural demand indicator, whereas the 
Negro basin has the highest DI in South America. In North America a 
range of basins scores equally highly, but for different reasons. For 
example, the Mississippi—Missouri basin scores highly particularly 
because of a high natural demand indicator value, whereas the Cali- 
fornia basin scores highly on all four demand indicators. 

Ultimately, the presence of mountain water resources, either as addi- 

tional rain or stored in snow, ice or lakes, in conjunction with a high 
demand downstream, determines whether a WTU has an indispensable 
role (Extended Data Fig. 2). The WTI is the product of the SI and the 
DI, for which the values are subsequently normalized over the range 
of WTI values found for all 78 WTUs (Fig. 1, Supplementary Table 1). 
Globally, the upper Indus basin is the most critical water tower unit 
(WTI=1.00 + 0.03) with abundant water resources in the Karakoram, 
Hindu-Kush, Ladakh and Himalayan mountain ranges in combina- 
tion with a densely populated and intensively irrigated downstream 
basin”**°. In North America, the Fraser and Columbia river basins are 
the most critical WTUs (WTI=0.62+ 0.07 and 0.58 + 0.06, respectively). 
The Fraser basin is rich in surface water resources, and has a high natu- 
ral water demand downstream, whereas the Columbia basin is rich 
in snow and glacier resources in combination with a high irrigation 
demand. In South America, the Cordillera Principal, the Cordillera 
Patagonica Sur and the Patagonian Andes are key WTUs in the supply 
of water to the South Atlantic and Pacific coastal regions and the Negro 
basin. In Europe, the Alps are the most relevant water-supplying moun- 
tain range, meeting the demands of the Rhéne (WTI = 0.45 + 0.07), Po 
(WTI=0.39 + 0.07) and Rhine (WTI = 0.32 + 0.11) basins. We note that 
several WTUs that score highly on either the SI or the DI do not rank 
highly in the final WTI. For example, the Tibetan Plateau and Arctic 
Oceanislands WTUs score highly on the SI, but have the lowest scores on 
the DI, owing to low water demands (Fig. 2b). By contrast, the Sabarmati 
in Asia with asmall portion of its water coming from the Himalayas has 
the highest DI, but alow SI. 
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Fig. 3| The vulnerability and projected change of the top five WTUs of each 
continent. The total vulnerability (indicated by larger polygons), and 
projected change indicators of the five most important WTUs oneach 
continent. BWS is the baseline water stress indicator of the basin*®; GE is an 
indicator for government effectiveness in the basin”; HT is hydro-political 
tension®’; dGDP* and dPop’ are the projected changes in gross domestic 
product and population between 2000 and 2050, according to Shared 


Vulnerability of the water towers 


We assess the vulnerability of each WTU and show this for the 
five most important (that is, with highest WTI values) WTUs in Asia 
and Oceania, Europe, North America and South America (Fig. 3, Sup- 
plementary Table 2). For this analysis, we include the hydro-political 
tension”, baseline water stress*®, government effectiveness”, pro- 
jected climate change”, projected change in GDP”, and projected 
population change’ (see Methods). The highest-ranking WTUs of South 
Americaand Asia in particular are more vulnerable than those in North 
America and Europe. Strikingly, the Indus, which is globally the most 
important water tower (Fig. 4), is also very vulnerable. The Indus is a 
transboundary basin with considerable hydro-political tension between 
its riparian countries Pakistan, India, China and Afghanistan. The popu- 
lation of approximately 235 million people in the basin in 2016 is pro- 
jected to increase by 50% by 2050, and the basin’s GDP is projected to 
encounter a nearly eightfold increase”. The average annual tempera- 
tureinthe Indus WTUis projected to increase by 1.9 °C between 2000 
and 2050, compared to 1.8 °C in the downstream section*®. The aver- 
age annual precipitation in the Indus WTU is projected to increase by 
0.2%, compared to 1.4% downstream”. It is evident that, owing to the 
expected strong growth in population and economic development, 
the demand for fresh water will rise exponentially”. Combined with 
increased climate change pressure on the Indus headwaters, an already 
high baseline water stress and limited government effectiveness, it 
is uncertain whether the basin can fulfil its water tower role within 
its environmental boundaries. It is unlikely that the Indus WTU can 
sustain this pressure. 


» 


South Argentina, Rhine Tarim interior 


South Atlantic coast 


Socioeconomic Pathway 2 (SSP2); dP*° and dT*’ are the projected 
precipitation and temperature changes between 2000 and 2050 according to 
the CMIP5 multi-model ensemble mean for Representative Concentration 
Pathway (RCP) 4.5*°. WTUs are ranked by vulnerability (highest vulnerability on 
top); colour filling indicates the WTU’s WTI value. See Methods for calculation 
details. 


The Indus does not stand alone, however. Nearly all important WTUs 
in Asia are also highly vulnerable (Fig. 3). Most WTUs are transbound- 
ary, densely populated, heavily irrigated basins and their vulnerability 
is primarily driven by high population and economic growth rates 
and, in most cases, ineffective governance. Moreover, the Syr Darya, 
Amu Daryaand Indus, in particular, are characterized by considerable 
hydro-political tension’. In most cases, downstream riparian states 
are dependent on mountain water resources provided by bordering 
upstream states to supply the competing irrigation, hydropower and 
domestic demands. In South America, the vulnerability is less than 
for the Asian WTUs, and the drivers are variable. On northern Chile’s 
Pacific coast, the baseline water stress and a projected decrease in 
precipitation (—4.8%) cause the vulnerability, whereas population 
and economic growth render the La Puna region’s WTU vulnerable. In 
North America, the vulnerabilities are related to population growth 
and temperature increase. 


Global assets with increasing importance 

Planetary boundaries (for example, the CO, concentration, global 
freshwater use and biosphere integrity) are defined as thresholds within 
which humanity can safely function without abrupt large-scale changes 
to the environment®. Climate change and biosphere integrity have 
been identified as the core planetary boundaries with the potential 
to change the state of the Earth system should they be consistently 
transgressed for a prolonged period of time“. The global food system, 
in particular, has been identified as a major pressure on the planetary 
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Fig. 4| WTI and vulnerabilities of the Indus basin. a, The supply and demand 
indicators. b, The vulnerabilities. See Methods for details on the supply and 
demand indicators and the meaning of the vulnerability ranges. S,, snow cover; 
Sy, intra-annual snow cover variability; S),, inter-annual snow cover variability; 
S,snow indicator; S,, lake and reservoir volume; L, surface water indicator; Gy, 
glacier ice volume; Pg, 4c— B, glacier water yield; G, glacier indicator; Pyry, WTU 


boundaries*. Without targeted technological changes and mitigation 
measures, it is expected that the adverse environmental effects of the 
food system could increase by more than 50% by 2050 relative to 2010, 
thus crossing the planetary boundaries®. In relation to the planetary 
boundaries, water towers are of particular importance. They are highly 
vulnerable to climate change, a key water supply that sustains the major 
global food systems in the world and rich in biodiversity. 

Aclear implication is that vulnerability can be decreased with conser- 
vation, or increased with inefficient water use. This may seem logical 
and obvious, but it also means that the priorities for the most urgent 
action can be shifted as the nations of WTUs practice conservation or 
grow in an unsustainable way. Although irreversible changes in the 
buffering capacity of water towers are underway, conservation of the 
water towers in the broadest sense starts with the global task to mitigate 
further global climate warming leading to cryosphere degradation 
and its adverse effects on the water towers’ buffering role. Ina more 
local or regional context, water conservation is the one part of the 
equation that is under the control of an individual nation’s part of a 
water tower system, calling for transboundary cooperation. Specific 
conservation can, for example, imply preserving the buffering capacity 
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precipitation; P,,;, basin precipitation; Pyy, WTU intra-annual precipitation 
variability; Pyy, WTUinter-annual precipitation variability; P, precipitation 
indicator; Diypy, net industrial demand; D,yp, industrial demand indicator; Dyary, 
natural demand; Dyq;, natural demand indicator; Dpomy, net domestic demand; 
Dpom domestic demand indicator; Dipgy, net irrigation demand; Dp, irrigation 
demand indicator. 


of mountain ranges in newly established protected areas, increasing 
the buffering capacity with reservoirs, and conservation of water by 
increasing water-use efficiency. Efficient use of scarce water resources 
can translate into improved wellbeing of people and increased eco- 
nomic and food security. 

The vulnerability of these water towers in the future is controlled 
bythe trajectory of change that aWTU andits associated downstream 
basin will follow. At the global scale we made a first-order assessment 
for amiddle-of-the-road scenario bothin terms of climate change and 
of socio-economic pathway (see Methods). However, it is important 
to acknowledge that the future pathways are extremely precarious 
and the outcomes diverging and uncertain. A recent assessment for 
the Hindu-Kush Himalayan region concluded that there is no single 
likely future: the region may run downhill, may do business as usual 
or it may advance to prosperity. Each of those future pathways will 
result in systematically different demands for water and may cross the 
planetary boundaries in varying degrees and this will probably hold for 
most WTUs, but those in Asia and South America in particular. 

Mountains are also an essential resource in the context of the United 
Nations’ Sustainable Development Goals (SDGs) that have been 


targeted towards the year 2030”. Mountains play a key part in achiev- 
ing the SDGs for water (SDG 6), food (SDG 2) and energy (SDG 7). Given 
the projected change in climate and socioeconomic development in 
mountain-dependent basins, it is evident that if the SDGs are to be 
achieved the water resources of the water towers need to be harnessed 
within safe environmental limits. 

We therefore make three essential recommendations. First, moun- 
tain regions must be recognized as a global asset of the Earth system. 
Second, it must be acknowledged that vulnerability of the world’s water 
towers is driven both by socio-economic factors and climate change. 
Third, we must develop international, mountain-specific conservation 
and climate-change adaptation policies (suchas national parks, pollut- 
ants control, emission reductions, erosion control and dam regulations) 
that safeguard the mountain ecosystems and mountain people and 
simultaneously ensure water, food and energy security of the millions 
of people downstream. 
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Methods 


Delineation of WTUs 

Inthis study, we define a WTUas the intersection of major river basins? 
and a topographic mountain classification based on elevation and 
surface roughness developed inthe framework of the Global Mountain 
Biodiversity Assessment (GMBA)*. Although other similar mountain 
classification datasets exist’ that are also based on a combination of 
elevation and surface roughness, we use the GMBA classification (ver- 
sion 1.2) because topographical names of mountain ranges have been 
assigned to each of the mountain regions classified. The original GMBA 
inventory contains 1,048 mountain regions worldwide. We make a sub- 
set of this dataset by imposing minimum thresholds for glacier area, 
glacier ice volume and snow persistence. We retain those mountain 
regions which have an ice volume larger than 0.1 km? (ref. *’) or an aver- 
age annualareal snow persistence larger than 10%’. After imposing these 
thresholds, 174 mountain regions remain. We intersect those regions 
with the major river basins and dissolve the result based on major river 
basin ID; that is, all selected GMBA regions within a basin are grouped 
as asingle WTU (Extended Data Fig. 1, Extended Data Table 1, Extended 
Data Table 2). The final WTU delineation contains 78 units (Extended 
Data Fig. 1). For each WTU we also define the downstream area that 
directly depends onthe WTU using the river sub-basin delineation’, and 
we specify which mountain ranges are part of the WTU (Extended Data 
Fig. 1, Extended Data Table 1, Extended Data Table 2). This dependent 
downstream area is smaller than the total downstream basin because 
not every downstream sub-basin is hydrologically connected to the 
WTU. To this end we start at the WTU and iteratively select each con- 
nected downstream sub-basin until the basin outlet, or lowest sub-basin 
in case of an endorheic system, is reached (Extended Data Fig. 1). 


Quantifying the WTI 
We combine an Sl anda Dlintoa WTI with which to rank WTUs. All grid 
calculations are performed at 0.05° resolution. 

The SI (see Extended Data Table 3 for all equations) is based on indi- 
cators for precipitation, snow cover, glaciers and surface water stor- 
age. For the precipitation indicator, the 2019 released ERAS reanalysis 
dataset is used. As sub-indicators, we first compute the total annual 
average (2001-2017) WTU precipitation (Extended Data Fig. 3a) relative 
to the overall basin precipitation (P,;). We then include the inter-annual 
variation in WTU precipitation (P,,) and the intra-annual monthly WTU 
variation (Pyy) based on the 2001-2017 time series. We combine these 
three sub-indicators into a precipitation indicator (P), giving the varia- 
tion (Py, and P,,y) the same weight as P,. The underlying assumption of 
including the variation is that ifthe variation is low, the WTU will provide 
a constant flow of water to the downstream basin, and therefore it isa 
more important WTU. For the snow cover indicator, we use the MODIS 
MOD10CMI1 product’. We derive an average annual snow cover (S,) in 
each WTU for the 2001-2017 period (Extended Data Fig. 3b). Here too, 
we derive both an inter-annual (Syy) and intra-annual (Syy) variation in 
snow cover, and using the same rationale as for the precipitation indi- 
cator, we combine the average snow persistence with the variation to 
derive a final snow indicator (S). For the glacier indicator, we compute 
the glacier ice volume ina WTU*’ (Extended Data Fig. 4a) relative to the 
average annual WTU precipitation (G,). We also compute the annual 
glacier water flux relative to the WTU precipitation on non-glacierized 
terrain (G,). We estimate the glacier water yield as the sum of the on- 
glacier precipitation and the mass balance per WTU. The WTU mass bal- 
ance is based onthe area-weighted average annual mass balance from 
all geodetic and direct mass balance measurements made available by 
the World Glacier Monitoring Service’. However, if there are fewer than 
ten glaciers with data available within a WTU then we use the regional 
average”. We average G, and G, to derive a final glacier indicator (G). 
For the surface water indicator (L), we compute the total volume of 
water that is stored in lakes and reservoirs ina WTU™ (Extended Data 


Fig. 4b) relative to the average annual WTU precipitation. The Slis the 
average of P,S,GandL. 

The Dlis based on net human water demands for domestic, industrial 
and irrigation purposes”’, and natural demand (see Extended Data 
Table 4 for all equations, Extended Data Fig. 5, Extended Data Fig. 6). 
Since data for the natural demand, defined as the minimum river flow 
required to sustain the ecosystem, are not readily available, we estimate 
it with the environmental flow requirement computed with the 90th- 
percentile exceedance value of the natural flow?! ”. First, the average 
monthly sectoral demands are computed based on a 2001-2014 time 
series (Dpomm Pirrm Dino Dyatm)- Part of each sectoral demand can 
potentially be met by downstream water availability that does not have 
its origin in the mountains. For each grid cell with a positive demand 
we therefore compute the average monthly water availability (WApomm: 
WAnrrm WAnp,m» WAnatm’ See Extended Data Table 4) as the precipitation 
minus the actual natural evapotranspiration®. We subtract this amount 
from the average monthly sectoral water demands as an estimate for 
the monthly demand that needs to be met by other sources, including 
the WTUs. We assume that the entire water deficit has to be provided 
by the WTU, although other water sources, suchas groundwater”, can 
also be important. We acknowledge that the global scale of our assess- 
ment also prevents us from fully taking into account the distribution 
and allocation of water within different portions of our spatial units 
of calculation. Finally, we aggregate these monthly net demands to 
be sustained by the WTU over all months and we divide it by the total 
annual sectoral demand to get four demand indicators (Dpom, Dinp, Dire, 
Dyat). The Dl is the average of the indicators Dpow, Dinp, Digg and Dwar 

The final WTI is the product of SI and DI, for which the values are 
subsequently normalized over the range of WTI values found for all 
78 WTUs. By using a multiplicative approach, we ensure that a WTU 
only ranks highly when it has considerable water resources (either as 
precipitation, glacier ice, snow and surface water or a combination) 
inthe mountains, and the demand for those resources downstream is 
likewise high (Extended Data Fig. 2). 


Uncertainty 
Itis acknowledged that the SI, Dland WTI are based on partly arbitrary 
choices of indicators and sub-indicators. In our assessment we have 
assigned an equal weight to each of the indicators constituting SI and 
DI. To account for uncertainty in the weight of each indicator in the 
WTI calculation we have performed a sensitivity analysis in which we 
randomly vary the weights of each of the eight indicators that constitute 
the Sand Dl and assess the impact onthe WTI ranking of the WTUs. We 
assume that the weight of each indicator is uniformly distributed and 
can bea maximum of three times as high or low as another indicator, 
and we assess through a 10,000-member Monte Carlo analysis how 
sensitive the rank of the WTUis asa result of this uncertainty (Extended 
Data Fig. 7). The analysis shows that the top and bottom of the ranking 
are robust and only limited shifts in the ranking occur (<5 positions). 
However, the middle part of the ranking is more sensitive to the weights 
of the indicators and there is a considerable number of WTUs where, in 
more than 25% of the total runs, the rank changes more than 5S positions. 
In addition, we also include a1,000-member Monte Carlo analysis to 
assess the propagation of uncertainty in the datasets used in the WTI 
calculation. For each input dataset we estimate a standard deviation 
and assuming a normally distributed error we sample from the distri- 
bution to assess how the input data uncertainty affects the WTI value 
(Supplementary Table 1) and WTU ranking (Extended Data Fig. 7). For 
precipitation we compute the standard deviation per WTU and per 
downstream basin based on nine different precipitation datasets (CRU 
bias-corrected with ERA-Interim, CRU TS2.1 downscaled with ERA-40, 
CRU TS3.21 downscaled with ERA-40, CRU TS3.21 downscaled with 
ERA-Interim, WFDEI, NCEP-NCAR Reanalysis, WATCH, WATCH cor- 
rected with GPCC, and ERAS)? *’. For evapotranspiration we take a 
similar approach using four different datasets (ERA-Interim, GLEAM, 


MERRA-2, PCR-GLOBWB forced with ERA-Interim, and ERAS)***°° ©, 
Values for snow persistence, ice volumes, glacier mass balance, and the 
domestic, industrial and irrigation water demands are derived from 
the literature!”4*“ ©, For the uncertainty in lake and reservoir volume 
we assumea standard deviation of 10% and we keep the environmental 
flow requirement constant. The ranking is also sensitive to input data 
uncertainty; however, the ranking is robust, in particular in the top 
20 places of the ranking where only limited shifts in positions occur. 
Here, too, most shifts are observed in the middle part of the ranking. 


Assessing vulnerabilities 

For the WTUs, we assess the vulnerability of their role as water tower 
based on three static indicators for water stress, government effec- 
tiveness and the potential for hydro-political tension in case of trans- 
boundary basins (Supplementary Table 2). In addition, we include four 
change indicators: the projected change in temperature, precipitation, 
population and gross domestic product between 2000 and 2050. In 
all cases we use the ensemble mean RCP4.5 climate change scenario 
in combination with the SSP2 shared socio-economic pathway” 
as a middle-of-the-road scenario, both in terms of economic develop- 
ment and associated climate change (Supplementary Table 2). We 
scale the different vulnerability indicators between 0 (minimum vul- 
nerability) and 1 (maximum vulnerability) considering the thresholds 
defined below. 

For water stress, we use the baseline water stress (BWS) indicator’®. 
BWS measures the ratio of total water withdrawals to the available 
renewable surface and groundwater supplies; higher values indicate 
more competition among users. The index value is derived from an ordi- 
nary least-squares regression fitted through raw monthly water-stress 
values for 1960-2014, taking the fitted BWS value for 2014°**. We com- 
pute the area-averaged BWS for all WTUs, including their downstream 
dependent areas and scale between 0 and 5, which is the range of the 
BWS scale in ref. °°. High BWS is associated with high vulnerability and 
low BWSis associated with low vulnerability. Since no global dataset for 
water management capacity is available at the global scale we validated 
the indicators gross domestic product (GDP)*, human development 
index (HDI) and government effectiveness (GE)”’ as proxies for water 
management capacity, which is available for selected mountainous 
basins only. GE shows the best correlation with water management 
capacity in the selection of basins, and we calculate the area-averaged 
value for each WTU including its downstream dependent area. We 
scale between —1.5 and 2.0, which are the minimum and maximum 
values found for the WTUs. A low value for GE implies high vulnerability 
whereas a high value for GEindicates low vulnerability. Lastly, all trans- 
boundary basins are assessed on the risk for potential hydro-political 
tensions based ona global mapping of basins that are ill-equipped to 
deal with transboundary disputes triggered by the construction of new 
dams and diversions”. We compute the WTU basin aggregated score 
provided by the cited study and the range of the original scale in the 
cited study (0 to 5) is used to scale between minimum and maximum. 

For each WTU we compute a projected multi-model ensemble mean 
change in precipitation (measured as a percentage) and temperature 
(measured in kelvin) between 2000 and 2050 for RCP4.5 for 35 differ- 
ent CMIP5 climate models*°. For projected changes in temperature 
the scores for the individual WTUs are linearly scaled between O and1 
for the full range of projected temperature increases of all WTUs. For 
precipitation projections, only decreases in precipitation are assumed 
to contribute to vulnerability (that is, projections of increases in pre- 
cipitation and unchanged precipitation are classified as minimum 
vulnerability). The scores for the individual WTUs are scaled linearly 
between 0 and 1, where O indicates unchanged or increasing precipita- 
tionand Lindicates the largest precipitation decrease projected for all 
78 WTUs. The projected population change between 2016 and 2050 
for SSP2 is derived from the HYDE database’ and the relative increase 
for each of the WTU basins is computed. All WTUs are scaled between 


a growth of 0% and a maximum of 50%, that is, if the projected popu- 
lation growth is more than 50%, a WTU has maximum vulnerability. 
The relative increase in GDP between 2000 and 2050 is computed 
per WTU basin, with the assumption that a strong projected increase 
in GDP is indicative of a strong growth in water demand. Data for the 
SSP2 shared socio-economic pathway are used*". All WTU basins are 
scaled between the minimum and the maximum, which is capped by 
a growth rate of 1000%. 

We assess indicators of various nature for vulnerability and future 
changes. To assess a complete vulnerability based on this set of indica- 
tors is challenging and requires knowledge of the weights of the indi- 
vidual indicators in assessing the total vulnerability for each WTU. The 
caveat is made that we consider a middle-of-the-road scenario bothin 
terms of projected climate change and socio-economic development 
as a first-order assessment. The future development pathway in most 
WTUs, in particular in Asia and South America, is uncertain and highly 
diverging and depends on the global economy, regional growth rates 
and geopolitical tensions, which are difficult to project or quantify. 
In addition, a satisfactory representation of mountainous climate in 
General Circulation Models is difficult, leading to large uncertainty in 
particular for future precipitation projections. 

In our study we assess impacts-driven vulnerability, where vulner- 
ability is defined in direct proportion to the magnitude of hydrological 
change. However, we note that recent work on the human dimensions 
of climate change have demonstrated that vulnerability emerges from 
the interaction of both environmental and social dynamics in specific 
contexts®”°, 


Data availability 


The data generated to support the findings of this study are available 
in an online data repository at zenodo.org at https://doi.org/10.5281/ 
zenodo.3521933. Third party data used in this study are available as 
follows. Hydrological basin boundaries’ used in this study are available 
online at http://www.fao.org/nr/water/aquamaps/. Mountain defini- 
tion data*® used in this study are available online at https://ilias.unibe. 
ch/goto_ilias3_unibe file 1047348.html. Precipitation and evaporation 
data used in this study” are available online at https://cds.climate. 
copernicus.eu. Snow cover data used in this study’ are available online 
at https://nsidc.org/data/MOD10CM. Glacier volume data* used in this 
study are available online at https://doi.org/10.3929/ethz-b-000315707. 
Glacier mass balance data” are available online at https://wgms.ch/. 
Lake and reservoir storage data® used in this study are available online 
at https://www.hydrosheds.org/pages/hydrolakes. Water demand data 
used in this study are available upon request from Y.W. (wada@iiasa. 
ac.at). BWS data®* used in this study are available online at https://www. 
wri.org/aqueduct. GE data” used in this study are available online at 
https://info.worldbank.org/governance/wgi/#home. Data on hydro- 
political tensions for transboundary river basins” used in this study are 
available online at https://transboundarywaters.science.oregonstate. 
edu/content/transboundary-freshwater-spatial-database. Data for 
future projections of population count’ used in this study are available 
online at ftp://ftp.pbI.nl/hyde/SSPs/SSP2/zip/. Data for future projec- 
tions of GDP“ used in this study are available online at http://www.cger. 
nies.go.jp/gcp/population-and-gdp.html. Data for future projections 
of temperature and precipitation” used in this study are available 
online at https://climexp.knmi.nl. An online interactive visualization 
of the water tower index and vulnerability is available at https://www. 
nationalgeographic.com/environment/perpetual-planet/. 


Code availability 


The code developed for the WTI calculations performed for this study 
are publicly available in a Github repository at https://github.com/ 
mountainhydrology/pub_ngs-watertowers. 
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Extended Data Fig. 1|See next page for caption. 
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Extended Data Fig. 1| Concept and global spread of WTUs. a, The WTUs are 
defined as the intersection of Earth’s major hydrological basins’ and mountain 
ranges® meeting predefined thresholds for ice volume or snow persistence 

(see Methods section). One WTU can consist of (parts of) multiple mountain 
ranges and one mountain range can be part of multiple WTUs. The example 
shows two hydrological basins in North America: the Great Basin (red outline) 
and California (blue outline). The striped areas indicate two mountain ranges: 
the Sierra Nevada and the Cascade Range. The intersection of the hydrological 
basins and the mountain ranges defines the WTUs (dark colours). For example, 
the Great Basin WTU is defined as the portion of the Sierra Nevada that is part of 


the Great Basin hydrological basin (dark red), and the California WTU is defined 
as the portion of the Sierra Nevada that is part of the California hydrological 
basin as well as a portion of the Cascade Range that is part of the California 
hydrological basin (dark blue). The WTU’s dependent area (light colours) is 
defined as the sub-basins within the hydrological basin that are overlapping the 
WTUor downstream of sub-basins overlapping the WTU. b-e, The WTUs (dark 
colours) and associated WTU basins (light colours) for all78 WTUs and WTU 
basins, grouped by continents: North America (b), Europe (c), Asiaand Oceania 
(d), South America (e). Number labels indicate the WTU IDs (see Extended Data 
Tables 1, 2 for corresponding names). 


oO =< 


b North America South America 

— 

1 
ue 
= 
oO 

a 0 0.1 0.2 0.3 0.4 0 0.1 0.2 0.3 0.4 
ne} 
£ 
2 | 

C 

oO 

= 

0) 

Q 


0.1 0.2 0.3 0.4 


0.4 0 
Supply Index (Sl) (-) 


Extended Data Fig. 2|Sland DI.a, The WTUSI (blue colourscale) and Background colour gradient indicates water tower importance (thatis, darker 
downstream DI (brown colourscale) for all78 WTUs and WTU basins. b, Supply shades represent higher SI and DI values). Points are labelled with WTU IDs (see 
index (SI) and demand index (DI) for each WTU grouped per continent. Extended Data Tables 1, 2, Extended Data Fig. 1). 
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Extended Data Fig. 3| Annual precipitation and snow cover. a, Average annual precipitation between 2001 and 2017, resampled bilinearly to 0.05° resolution 
based on ERA5™. b, Average snow persistence between 2001 and 2017, resampled to 0.05° resolution based on MODIS MODIOCMI1’. 
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Extended Data Fig. 4 | Glacier ice volume and lake and reservoir volume. a, Total aggregated glacier ice volume per WTU. b, Total aggregated lake and 
reservoir water volume per WTU°. 
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Extended Data Fig. 5 | Water use for irrigation and industry. a, Average annual irrigation water use per 0.05 x 0.05° grid cell 2001-2014”. b, Average annual 
industrial water use per 0.05 x 0.05° grid cell 2001-2014”. 
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Extended Data Fig. 6 | Domestic water use and natural water demand. a, Average annual domestic water use per 0.05 x 0.05° grid cell 2001-2014™. b, Total 
aggregated average annual natural water demand 2001-2014 per WTU basin based on the Environmental Flow Requirement®**)?. 
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WTUs by WTI resulting from uncertainty in the weights of individual indicators 
(red), expressed as a percentage of 10,000 realizations of the WTI index 
calculation. 


Extended Data Fig. 7 | Sensitivity of WTU ranking to uncertainty in input 
data and indicator weights. Position change in ranking of WTUs by WTI 
resulting from uncertainty in input data (blue), expressed as a percentage of 
1,000 realizations of the WTIindex calculation. Position change in ranking of 


Extended Data Table 1| List of WTUs and the GMBA mountain ranges that are (partly) covered by each WTU, for North 
America and South America 


WTU 
ID 


Bl wo}]hy 


WTU Name 


Mississippi-Missouri 


California 
Great Basin 
N-America, Colorado 


Columbia and NW-US 


Fraser 


Pac. and Arc. Coast 


Saskatchewan-Nelson 
NW-Territ. and Nunavut 
Hudson Bay Coast 

Atl. Ocean Seaboard 


Mackenzie 


Rio Grande-Bravo 
Caribbean Coast 
Magdalena 
Orinoco 


Amazon 
La Plata 
S-America, Colorado 


Negro 

S-Arg., S-Atl. Coast 
Col.-Ecuad., Pac. Coast 
Peru, Pac. Coast 


N-Chile, Pac. Coast 
S-Chile, Pac. Coast 
La Puna Region 


Salinas Grandes 


GMBA mountain ranges (partly) covered by WTU 

Bighorn Mountains, Absaroka Range, Crazy Mountains, Lewis Range, Swan Range, Flathead Range, Wind 
River Range, Front Range, Medicine Bow Mountains, Gore Range, Sawatch Range 

Sierra Nevada, Cascade Range 

Sierra Nevada 

San Juan Mountains, Wind River Range, Front Range, Medicine Bow Mountains, Gore Range, Sawatch Range 


Absaroka Range, Lewis Range, Swan Range, Mission Range, Flathead Range, Purcell Mountains, Cabinet 
Mountains, Sawtooth Mountains, Teton Range, Wind River Range, Wallowa Mountains, Cariboo Mountains, 
Monashee Mountains, Selkirk Mountains, Coast Mountains, Rocky Mountains Calgary, Scrip Range, Cascade 
Range 


Coast Mountains, Skeena Mountains, Omineca Mountains, Cariboo Mountains, Monashee Mountains, 
Hazelton Mountains, Rocky Mountains Calgary, Scrip Range, Cascade Range 


Chugach Mountains, Kenai Mountains, Alaska Range, Coast Mountains, Aleutian Range, Kodiak and Afognak 
Island, Alexander Archipelago, Vancouver Island, Brooks Range, Saint Elias Mountains, Wrangell Mountains, 
Kilbuck Mountains, Talkeetna Mountains, Mackenzie Mountains, Wernecke Mountains, Selwyn Mountains, 
Pelly Mountains, Skeena Mountains, Stikine Ranges, Cassiar Mountains, Omineca Mountains, Hazelton 
Mountains, Cascade Range, Olympic Mountains 


Lewis Range, Rocky Mountains Calgary 
Baffin Island 

Torngat Mountains 

Torngat Mountains 


Mackenzie Mountains, Wernecke Mountains, Selwyn Mountains, Pelly Mountains, Stikine Ranges, Cassiar 
Mountains, Omineca Mountains, Rocky Mountains Calgary 


San Juan Mountains, Sawatch Range 

Sierra Nevada de Santa Marta, Cordillera Oriental Colombia Venezuela 

Sierra Nevada de Santa Marta, Cordillera Central Colombia, Cordillera Oriental Colombia Venezuela 
Cordillera Oriental Colombia Venezuela 


Cordillera Central Colombia, Cordillera Oriental Colombia Venezuela, Cordillera Central Ecuador, Cordillera 
Oriental Peru Bolivia, Cordillera Occidental Peru Bolivia Chile, Altiplano 


Cordillera Oriental Peru Bolivia 


Cordillera principal, Cordillera de Oliva, Cordillera de Ollita, Cerro de Ansilta, Central Volcanic Zone, Cordillera 
Frontal 


Cordillera principal, Northern Patagonian Andes 

Cordillera Patagonica Sur, Northern Patagonian Andes, Andes fueginos 
Cordillera Central Colombia, Cordillera Central Ecuador 

Cordillera Central Ecuador, Cordillera Occidental Peru Bolivia Chile 


Cordillera principal, Cordillera Occidental Peru Bolivia Chile, Sierra de la Punilla, Sierra de Tatul, Cordillera de 
Oliva, Cordillera de Ollita, Cerro de Ansilta, Central Volcanic Zone, Cordillera Frontal 


Northern Patagonian Andes, Cordillera Patagonica Sur, Andes fueginos, Cordillera principal, Cordillera Frontal 


Cordillera Oriental Peru Bolivia, Cordillera Occidental Peru Bolivia Chile, Altiplano, Central Volcanic Zone, 
Cordillera Frontal 


Central Volcanic Zone, Cordillera Frontal 
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Extended Data Table 2 | List of WTUs and the GMBA mountain ranges that are (partly) covered by each WTU, for Europe, Asia 


and Oceania 
WTU WTU Name 

28  Spain-Portugal, Atl. Coast 
29 = Spain, S-and E-Coast 
30 Ebro 
31 Garonne 
32 France, W-Coast 
33. Rhdéne 
34 France, S-Coast 
35 Rhine 
36 Po 
37 Italy, W-Coast 
38 Italy, E-Coast 
39 Danube 
40 Sweden 
41 Adr. Sea-Bl. Sea Coast 
42 Volga 
43 Ural 
44 Black Sea, N-Coast 
45 Caspian Sea Coast 
46 Scandinavia, N-Coast 
47 Russia, Barents Sea Coast 
48 Arc. Ocean Islands 
49 Iceland 
50 ~~ Gobi Int. 
51 Yellow River 
52 Tarim Int. 
53 Tibetan Plateau 
54 Yangtze 
55 Mekong 
56  Salween 
57 Irrawaddy 
58 Ganges-Bramaputra 
59 Sabarmati 
60 Indus 
61 Lena 
62 Siberia, N-Coast 
63 Yenisey 
64 Kara Sea Coast 
65 Ob 
66 Siberia, W-Coast 
67 Black Sea, S-Coast 
68 Caspian Sea, S-W-Coast 
69 Tigris-Euphrates. 
70 ~=Persian Gulf Coast 
71 ~~ Central Iran 
72 Helmand 
73. Farahrud 
74 Caspian Sea, E-Coast 
75 Amu Darya 
76 ~~ Syr Darya 
77 Lake Balkash 
78 New Zealand 


GMBA mountain ranges (partly) covered by WTU 
Pyrenees 
Pyrenees 
Pyrenees 
Pyrenees 
Pyrenees 
European Alps 
European Alps, Pyrenees 
European Alps 
European Alps, Pyrenees 
European Alps 
European Alps 
European Alps, Dinaric Alps 
Scandinavian Mountains, Jotunheimen 
Dinaric Alps, European Alps 
Ural Mountains 
Ural Mountains 
Greater Caucasus 
Greater Caucasus 
Scandinavian Mountains, Jotunheimen 
Ural Mountains 
Svalbard, Greenland Kalaallit Nunaat, Novaya Zemlya 
Iceland 
Haanhohiy Uul, Borohoro-Shan, Khrebet Dzhungarskiy Alatau, Khrebet Saur, Bogda Shan, Karlik Shan, Tulai Nanshan, 
Lenglong Ling, Datong Shan, Banyan Har Shan, Qionglai Shan, Anyemaqen Shan 
Alayskiy Khrebet, Ferganskiy Khrebet, Terskey Ala Too, Kokshaal Too, Borohoro-Shan, Narat Shan, Horo Shan, Eren 
Nganglong Kangri, Gangdise Shan, Nyaingentanglha Shan, Tanggula Shan, Tibetan Plateau 
Tanggula Shan, Banyan Har Shan, Ningjing Shan, Chola Shan, Shaluli Shan, Daxue Shan, Qionglai Shan, Tibetan 
Tanggula Shan, Ningjing Shan, Patkai Hills, Mishmi Hills, Tibetan Plateau, Yun Range 
Nyaingentanglha Shan, Tanggula Shan, Patkai Hills, Mishmi Hills, Tibetan Plateau 
Patkai Hills 
Gangdise Shan, Nyaingentanglha Shan, Tanggula Shan, Himalaya, Patkai Hills, Mishmi Hills 
Himalaya 
Himalaya, Ladakh Range, Pamir Mountains, Karakorum, Hindu Kush, Nganglong Kangri, Gangdise Shan, Malakand 
Baykal'skiy Khrebet, Khrebet Kodar, Verkhoyanskiy Khrebet, Khrebet Suntar Khayata 
Gory Putorana 
Haanhohiy Uul, Shopshal'skiy Khrebet, Kuznetskiy Alatau, Zapadnyy Sayan, Vostochnyy Sayan, Baykal'skiy Khrebet, 
Ural Mountains 
Khrebet Saur, Seminskiy Khrebet, Aygulakskiy Khrebet, Kuroyskiy Khrebet, Shopshal'skiy Khrebet, Kuznetskiy Alatau, 
Chukotskiy (Anadyrskiy) Khrebet, Koryakskiy Khrebet, Sredinnyy Khrebet, Verkhoyanskiy Khrebet, Momskiy Khrebet, 
Kuzey Anadolu Daglari / Pontus Mountains, Lesser Caucasus, Greater Caucasus 
Agri Dagi, Suphan Dagi, Kiih-e haye Sabalan, Alborz Mountains, Zagros Mountains, Lesser Caucasus, Greater 
Mercan Daglari, Hakkari Daglari, Suphan Dagi, Zagros Mountains 
Zagros Mountains 
Alborz Mountains, Zagros Mountains 
Hindu Kush 
Hindu Kush 
Hindu Kush, Alborz Mountains 
Zeravshan, Pamir-Alay, Turkestainskiy Khrebet, Gory Baysun Tau, Pamir, Karakorum, Hindu Kush, Malakand Range 
Turkestanskiy Khrebet, Pamir-Alai, Ferganskiy Khrebet, Chatkal'skiy Khrebet, Talas Alatau, Kyrgyz Ala Too, Terskey Ala 
Kyrgyz Ala Too, Kungey Ala Too, Ile Alatau, Borohoro-Shan, Dzhungarskiy Alatau, Narat Shan, Tien Shan 
Ruapehu, Rolleston Range, Two Thumb Range, Liebig Range, Ben Ohau Range, Young Range, Olivine Range, 


Extended Data Table 3 | Overview of WTU supply indicators used 


Indicator 


Precipitation 
contribution WTU/basin 


Inter-annual variability 
in precipitation 


Intra-annual variability 
in precipitation 


Precipitation 


WTU snow cover 


Inter-annual variability 
in snow cover 


Intra-annual variability 
in snow cover 


Snow 


Glacier ice storage 


Glacier water yield 


Glaciers 


Lake and reservoir 
storage 


Final supply index 


Symbol 


Pr 


Pyy 


Puy 


Syv 


Suv 


Gs 


Sl 


Input 


Average annual WTU precipitation sum 
(2001-2017): Pwru (km°) 


Average annual basin precipitation sum 
(2001-2017): Pas (km®) 


Annual WTU precipitation for individual 
years (2001-2017): Py (km*) 


Average monthly WTU precipitation sum 
(2001-2017): Pm (km*) 


Average annual WTU snow cover: S (-) 


Annual average WTU snow cover (2001- 
2017): Sy (-) 


Average monthly snow cover (2001-2017): 
Sm (-) 


Total glacier ice volume in WTU: Gy (km°) 


Average annual WTU precipitation sum 
(2001-2017): Pwru (km?) 


Average annual WTU precipitation sum 
(2001-2017): Pwru (km°) 


Average annual precipitation sum glaciated 
area (2001-2017): Patac (km?) 


WTU average annual glacier mass balance: 


B (km?) 


Total volume stored in lakes and reservoirs 
in WTUs: Si (km°) 


Average annual WTU precipitation sum 
(2001-2017): Pwru (km*) 


Equation 


Pt = Pwtu/ Paas 


Pyyv = 1 - ((max(Py)-min(Py)) / max(Py) 


Puy = 1 - ((max(Pm)-min(Pm)) / max(Pm) 


P = 0.5*(Pyv+Pmv)*Pr 


Sw = 1 - ((max(Sy)-min(Sy)) / max(Sy) 


Suv = 1 - ((max(Sm)-min(Sm)) / max(Sm) 


S = 0.5*(Syv+Smv)*St 


Gs = Gv / (Gv+Pwrtu) 


Gy = (Petac — B) / (Pctac — B + Pwrtu) 


G = (Gst+Gy) / 2 
L=S_/ (Si+Pwrtu) 


(P+S+G+L)/4 


Reference 


32 


32 


32 


48 


32 


32 


50 


32 
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Extended Data Table 4] Overview of WTU demand indicators used 


Indicator Symbol Input Equation Reference 


Domestic demand Doom Average annual downstream domestic water use = (Dpom,m - WApomm) / Doomy o 
(2001-2014): Doomy (km?) 


Average monthly downstream domestic water use 
(2001-2014): Doom,m (km3) 


Average monthly P-ET (2001-2017) for downstream 
cells with domestic demand above threshold: WApbom,m 
(km*) 

Threshold is 1*10° km? per 0.05° grid cell 


Industrial demand Dino Average annual downstream industrial water use > (Dino,m — WAino,m) / Dinoy $4 
(2001-2014): Dinoy (km*) 


Average monthly downstream industrial water use 
(2001-2014): Dino,m (km) 


Average monthly P-ET (2001-2017) for downstream 
cells with industrial demand above threshold: WAino,m 
(km?) 

Threshold is 1*10° km? per 0.05° grid cell 


Irrigation demand Dirr Average annual downstream irrigation water use (2001- = (Dirnm— WAirrm) / Dirry et 
2014): Dirry (km?) 


Average monthly downstream irrigation water use 
(2001-2014): Dirrm (km’) 


Average monthly P-ET (2001-2017) for downstream 
cells with irrigation demand above threshold: WAirr.m 
(km?) 

Threshold is 1*10° km? per 0.05° grid cell 


Natural demand Dnart Average annual Environmental Flow Requirement at © (Dnatym — WAnatym) / Dnaty B1.2.64 


river basin outlet (2001-2014): Dnat,y (km?) 


Average monthly Environmental Flow Requirement at 
river basin outlet (2001-2014): Dnat,m (km*) 


Average monthly P-ET for downstream basin (2001- 
2017): WAnat,m (km?) 


Final demand index DI Dirk + Dino + Doom + Dwar) / 4 
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The partial pressure of CO, in the oceans has increased rapidly over the past century, 


driving ocean acidification and raising concern for the stability of marine 
ecosystems! >. Coral reef fishes are predicted to be especially susceptible to end-of- 
century ocean acidification on the basis of several high-profile papers** that have 
reported profound behavioural and sensory impairments—for example, complete 
attraction to the chemical cues of predators under conditions of ocean acidification. 
Here, we comprehensively and transparently show that—in contrast to previous 
studies—end-of-century ocean acidification levels have negligible effects on 
important behaviours of coral reef fishes, such as the avoidance of chemical cues from 
predators, fish activity levels and behavioural lateralization (left-right turning 
preference). Using data simulations, we additionally show that the large effect sizes 
and small within-group variances that have been reported in several previous studies 
are highly improbable. Together, our findings indicate that the reported effects of 
ocean acidification on the behaviour of coral reef fishes are not reproducible, 
suggesting that behavioural perturbations will not be a major consequence for coral 
reef fishes in high CO, oceans. 


The partial pressure of CO, in the oceans has increased from aver- 
age pre-industrial levels of around 280 ppatm to present-day levels 
of approximately 410 atm, driving a process known as ocean acidi- 
fication. End-of-century levels of CO, in the oceans are expected to 
reach 900-1,000 pLatm, exceeding what most marine species have 
experienced in the past 30 million years!”, and raising concerns over 
biodiversity loss and the stability of marine ecosystems’. 

Fishes have well-developed acid-base regulatory systems to 
maintain tissue pH, even when faced with partial pressure levels of 
CO, (peo) that exceed the end-of-century forecasts by 15 times 
(that is, 15,000 patm)*. Therefore, physiologists have historically 
considered fishes to be robust to near-future CO, levels’*. Notably, 
a number of highly publicized studies have reported detrimental 
effects of elevated CO, levels on the sensory systems and behaviours 
of fishes*’, with coral reef fishes appearing to be the most sensitive 
despite experiencing large daily and seasonal CO, fluctuations in 
nature (for example, 100-1,300 pratm)””°. Indeed, CO, levels of 
around 1,000 patm appear to alter or impair all of the sensory sys- 
tems and associated behaviours of coral reef fishes studied to 
date”". Reported effects across a range of life stages include alter- 
ations in olfaction, hearing, vision, learning, behavioural lateraliza- 
tion, activity levels, boldness, anxiety and susceptibility to 
predation”. This literature has contributed to dire predictions for 
fish populations and marine ecosystems that are at risk of ocean 
acidification”. 


Although the reported effects of ocean acidification on the sensory 
systems and behaviours of fishes are considerable, there are substantial 
disparities among studies and species, even when methodological 
approaches are similar’*”’. This discrepancy is surprising given that 
many of the most prominent studies that describe detrimental effects 
of ocean acidification on fish behaviour report exceptionally low vari- 
ability and large effect sizes*>?*"”, which should maximize the prob- 
ability of successful replication’’. Moreover, the proposed mechanism 
that underlies the sensory impairments (interference with the function 
of the neurotransmitter GABA, (y-aminobutyric acid) in the brain”) is 
reported to transcend animal phyla" and therefore should apply to 
all species of fish. 

Inresponse to the ‘reproducibility crisis’ that affects many scientific 
disciplines”, the scientific community is demanding that studies are 
rigorously conducted and independently replicated before drawing 
broad conclusions and implementing management measures, particu- 
larly when describing widespread phenomena of global importance”. 
Establishing a robust and independently replicated database of the 
effects of ocean acidification on fishes is essential to gain a reliable 
understanding of the consequences of climate change on marine eco- 
systems”, 

To this end, we commenced a three-year research program in 2014 
to quantify the effects of end-of-century ocean acidification on the 
sensory and behavioural ecology of coral reef fishes. Our objectives 
were to replicate and build on some of the most prominent studies in 
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Fig. 1| Widespread avoidance of predator chemical cues in coral reef 
damselfishes exposed to present-day and end-of-century levels of CO,. 
a, P.amboinensis.b, C. atripectoralis.c, D. aruanus.d, P. moluccensis. 

e, A. polyacanthus.a-e, Percentage of time (mean +s.e.m.) that fishes spent 
in water containing chemical cues of a predator (C. cyanostigma (a-d) or 

C. urodeta (e)) during two-current choice flume tests at the Lizard Island 
Research Station in 2014 (a-d, sub-adults and adults) and at the Australian 


this field to understand the diversity in behavioural responses within 
and across species. Notably, we aimed to enhance transparency and 
reduce methodological biases” by ensuring that our methods were fully 
documented and reproducible, and that raw data and videos of behav- 
ioural trials were publicly available and open to external review”. 


Responses to chemical cues from predators 


In fishes, the reversal of chemical cue preferences is one of the most 
alarming effects of elevated CO, reported to date. Initial studies on this 
phenomenon used choice flumes and reported that larval clownfish 
(Amphiprion percula) and damselfish (Pomacentrus wardi) exposed to 
elevated CO, (850-1,050 patm for 3-11 days) chose to spend a remark- 
able 90-100% of their time in water containing the chemical cues of 
predators (Cephalopholis cyanostigma or Pseudochromis fuscus) 
instead of avoiding these cues like conspecifics that were maintained 
at present-day CO, levels (0O-10% of time in predator cues)**. These 
reports concluded that prey species will be attracted to their predators 
in a high CO, world. Many reports of cue preference reversal in coral 
reef fishes have since been published, including for fishes obtained 
from natural CO, seeps” and those experiencing transgenerational 
acclimation to elevated CO, under laboratory conditions”. 

Our experiments used established protocols in choice flume meth- 
odology (see Methods), including video footage of experiments (with 
pre-trial notes indicating the treatment history of each fish; see https:// 
youtu.be/iHOw7Waztjo) and the use of automated tracking software. 
We quantified the effects of elevated CO, on predator cue avoidance 
across 3 consecutive years in 560 individuals from 6 species of poma- 
centrid coral reef fishes (Acanthochromis polyacanthus, Chromis atrip- 
ectoralis, Dascyllus aruanus, Dischistodus perspicillatus, Pomacentrus 
amboinensis and Pomacentrus moluccensis). Experiments covered a 
range of temperatures (Extended Data Table 1), CO, acclimation pro- 
tocols were kept consistent with previous studies (4 or more days at 
around 1,000 patm)*>” and four of our study species (A. polyacanthus, 
D. aruanus, P. amboinensis and P. moluccensis) have previously been 
reported to exhibit severe behavioural impairments following exposure 
to high CO, levels!*”>”°, 

All four species of adult and sub-adult wild fishes tested in 2014 (C. 
atripectoralis, D. aruanus, P. amboinensis and P. moluccensis) signifi- 
cantly avoided the predator cue (C. cyanostigma) in both control and 
high CO, groups (Fig. 1la-d and Extended Data Table 2; pooled across 


Institute of Marine Science in 2015 (e, juveniles). Control fish (maintained in 
water containing around 410 patm CO,) inclosed grey circles, CO,-exposed fish 
(maintained in water containing around 1,000 patm CO,) inopen blue circles 

(n of biologically independent animals are shown in the figure panels). Data 
were excluded between 10 min and 13 min for the predator cue switch. See 
Extended Data Table 2 for statistics. Fish illustrations by E. Walsh and S. Rowan. 


all species, n=164, all P> 0.21). The following year (2015), we detected 
aCO, treatment effect for A. polyacanthus juveniles reared in captivity 
(Extended Data Table 2; n = 100, P< 0.001): control fish spent 39 + 2% 
(model estimate +s.e.) of their time in the predator cue (Cephalopholis 
urodeta) whereas fish acclimated to high CO, levels spent 54 + 3% of 
their time in the predator cue (Fig. le). This CO, treatment effect was 
not replicated in wild A. polyacanthus of any life stage in 2016 (Fig. 2a, 
band Extended Data Table 2; n=94, P=0.86), nor were there any treat- 
ment effects for any of the life stages of D. aruanus (n= 83, P=0.09) or 
D. perspicillatus (n = 119, P= 0.30) tested in that same year (Fig. 2c—e 
and Extended Data Table 2). 

Overall, we detected a modest CO, treatment effect (no avoidance 
of predator cue) in one of six species in one of the two years in which 
that species was examined. These findings demonstrate that none of 
the coral reef fishes that we examined exhibited attraction to predator 
cues when acclimated to high CO, levels, in contrast to previous reports 
onthe same and other species*>”’, 

To investigate the marked disparity between our findings and previ- 
ous reports for coral reef fishes, we took subsets of our choice flume 
data (n= 247 control, n= 239 high CO,; 4 min per trial) to replicate the 
4-min analysis approaches used previously (that is, around 9-min trials, 
using 2 min of data before and after the cue switch)**"*"”*52”, We then 
used bootstrapping simulations to compare our data with previous 
datasets (Supplementary Information). On the basis of 10,000 boot- 
strap samples per scenario, we demonstrate using our large dataset 
that the results reported previously for coral reef fishes are highly 
improbable (probability of 0 out of 10,000): our frequency histograms 
of bootstrapping outputs show no evidence of CO, effects on chemical 
cue avoidance (Fig. 3a—c), and the within-group variance reported in 
previous studies is typically lower than what is statistically realistic 
(Fig. 3d-f). 


Activity levels 

Coral reef fishes exposed to end-of-century CO, levels have been stated 
to exhibit up to 90-fold higher levels of activity”, prompting sugges- 
tions that these changes could underlie the higher mortality rates 
reported for fish that have been briefly exposed to high CO, and then 
placed onto patch reefs in the wild under present-day CO, conditions’. 
Notably, most activity measurements (for example, distances moved) 
from coral reef fishes have not used video footage but have been made 
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d Switch 


Fig. 2|Damselfishes avoid predator chemical cues tothe same degree 
when exposed to present-day or end-of-century CO, levels irrespective 

of life stage. a,b, A. polyacanthus.c, D. aruanus.d, e, D. perspicillatus. 

a-e, Percentage of time (mean+s.e.m.) that fishes from early life stages and 
later life stages (including mid and late life stages) spent on one side of atwo- 
current choice flume during experiments at the Lizard Island Research Station 
in 2016. Control fish (maintained at approximately 520 pratm) inclosed grey 


using direct manual observations, either by SCUBA divers or by an 
observer counting the number of gridlines crossed by fish in aquaria>”®. 

We filmed 582 individuals from 6 species across 3 years and quanti- 
fied swimming activity in behavioural arenas using automated track- 
ing software. Activity levels were assessed in adults and sub-adults 
of 5 species in 2014, with 3 species showing no detectable effects of 
CO, treatment (C. atripectoralis, P. amboinensis and P. moluccensis; 
Fig. 4c—e and Extended Data Table 3; pooled across all species, n=126, 
P>0.08). We found some evidence that activity was affected by high 
CO, in D. aruanus, for which an interaction between CO, treatment and 
standard length suggested that activity was elevated by approximately 
59-92% in smaller individuals (<37 mm standard length) in high CO, 
levels (Fig. 4b, Extended Data Fig. 1a and Extended Data Table 3; n=46, 
P=0.03). In A. polyacanthus, activity levels were increased by around 
50% (P=0.009) in fish acclimated to elevated CO, levels after control- 
ling for astrong main effect of standard length (Fig. 4a, Extended Data 
Fig. 1b and Extended Data Table 3; n=16, P< 0.001). 

When we extended our experiments in 2015 using captive-reared 
juvenile A. polyacanthus with greater sample sizes and longer-duration 
trials (Supplementary Information), the effect of CO, on activity disap- 
peared (Extended Data Table 3; n = 66, P= 0.1). There was, however, a 
weak interaction (P= 0.04); activity decreased in the high CO, fish (but 
not controls) with increasing body size (Fig. 4a, Extended Data Fig. 1c 
and Extended Data Table 3). In 2016, we conducted additional tests 
of activity in wild fish across various life stages and found no effects 
of CO, nor any interactions with body size in any of the three species 
(n=122D. perspicillatus, n=112 A. polyacanthus, n= 94 D. aruanus; all 
CO, main effects P> 0.24; Fig. 4 and Extended Data Table 3). 
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circles, CO,-exposed fish (maintained at around 1,090 patm) in open blue 
circles. Fish were given 40 min to habituate to the flume (during which 

time their activity was quantified; see Fig. 4). Predator chemical cues 

(C. cyanostigma) were introduced to one side of the flume for 20 min and then 
switched to the other side for another 20 min. n of biologically independent 
animals are shown in the figure panels. See Extended Data Table 2 for statistics. 
Fish illustrations by E. Walsh. 


Overall, we found that fish exposed to high CO, did not exhibit con- 
sistently elevated activity levels compared with conspecifics under 
control conditions (Fig. 4). Rather, we found that activity levels were 
highly variable among individuals, increasing the risk of type-I errors 
in experiments using small sample sizes’, and possibly in large-sample 
experiments that rely on human observation rather than automated 
video analysis” **. 


Behavioural lateralization 


Atendency to favour the left or right side during behavioural activities 
(that is, behavioural lateralization) is thought to be an expression of 
functional asymmetries of the brain; this is important for tasks such as 
schooling and predator avoidance”. Elevated CO, has been reported 
to reduce or abolish behavioural lateralization in fishes”, presum- 
ably asa result of brain dysfunction”. Population-level lateralization is 
present whena group of individuals collectively exhibits a side bias (the 
mean number of turns to one side significantly exceeds 50%), whereas 
individual-level lateralization is present when more individuals within 
a tested group exhibit a side bias than expected by chance (based on 
a binomial distribution with a = 0.5). Both types of lateralization are 
independent of each other, but are not mutually exclusive (see Methods 
and Supplementary Information for details). 

Using a standard detour test in a double T-maze, we quantified 
the effects of elevated CO, levels on behavioural lateralization using 
175 fishes across four species in 2014 (C. atripectoralis, D. aruanus, 
P. amboinensis and P. moluccensis). None of the species exhibited 
population-level lateralization under control conditions (Extended 
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Fig. 3 | Bootstrapping data simulations of predator chemical cue avoidance 
and within-group variance. Bootstrapping data simulations reveal that fish 
avoid predator chemical cues regardless of whether they are acclimated to 
present-day or end-of-century CO, levels (a-c), and the within-group variance 
in many previous studies is lower than statistically reasonable (d-f). 

a-c, Frequency outputs from bootstrapping simulations of the mean 
percentage of time spent in water containing predator cues when n=10 (a), 
n=20(b) orn=60(c) fish were sampled from each of the control (grey) and 
high CO, (blue) treatment groups (total nin sampled dataset: 247 control, 

239 high CO,; sample sizes represent biologically independent animals). The 
frequency distributions fall to the left of 50% (dashed vertical line) in both 
treatment groups, indicating similar avoidance of predator chemical cues 
under control and high CO, conditions. This is markedly different from 
previous reports of major effects of high CO, levels on predator and alarm cue 
avoidance in coral reef fishes (examples presented in coloured circles, selected 


Data Fig. 2a-d and Extended Data Table 4), and only C. atripectoralis 
showed slight population-level lateralization under high CO, (P= 0.047; 
Extended Data Table 4). Three species (C. atripectoralis, D.aruanus and 
P. moluccensis) exhibited no individual-level lateralization under con- 
trol conditions, which remained unchanged under high CO, conditions 
(Extended Data Fig. 2a—c and Extended Data Table 4). A treatment effect 
was detected for individual-level lateralization in P amboinensis, with 
the high CO, group displaying reduced individual-level lateralization 
compared with controls (Extended Data Fig. 2d and Extended Data 
Table 4). However, this effect was no longer present when a subset of 
the same individuals was retested 7-8 days later (n =15 control, n=15 
high CO,; Extended Data Fig. 2e and Extended Data Table 4). Although 
our sample sizes were comparable to many similar studies (for example, 
ref. !”), our inconsistent findings for P. amboinensis are likely to bea 
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Variance around mean time in predator cue 
to matchthe group sample sizes presented in figure panels: closed circles, 
control; open circles, high CO,). d-f, Frequency histograms (light grey) of the 
associated variance around the means from bootstrapping simulations 
presented ina-c (control and high CO, fish pooled for simplicity). Also 
presented are results of previous studies of coral reef fish (variance around the 
group mean, where similar groups were combined for simplicity) that have 
used choice flumes to examine chemical cue preferences. a, d, Dark-blue circle, 
data from ref."*. b, e, Dark-blue circle, data from ref. ” (this circle overlaps with 
the red circle inb); red circle, data from ref.°; light-blue circle, data from ref. ””. 
c,f, Blue circle, data from ref. *; red circle, data from ref.”°. e, f, Black circles 
indicate references that did not examine the effects of high CO, and/or 
predator or alarm cue avoidance and thus do not appear ina-c.e, Left black 
circle, data from ref. °; right black circle, data from ref.™. f, Black circle, data 
fromref.*”. For additional details, see Supplementary Information. 


consequence of low statistical power in a behavioural test that exhibits 
high inter-individual variability’’ (Extended Data Fig. 2). 

We increased statistical power in 2015 when behavioural lateraliza- 
tion was tested in wild and captive-reared A. polyacanthus (n=120 con- 
trol,n=104 high CO,), aspecies for which impairments in lateralization 
caused by high CO, levels have been reported”. In contrast to previ- 
ously reported results, we found no effect of CO, levels on behavioural 
lateralization: A. polyacanthus exhibited individual-level lateralization 
and no population-level lateralization, both under control and high 
CO, conditions (Extended Data Fig. 2f and Extended Data Table 5). On 
the basis of the previous studies that have reported that elevated CO, 
levels impair visual acuity”°”’, we slightly offset the barrier at one end 
of the lateralization arena, creating a shorter path around the barrier to 
the left. We predicted that fish under high CO, levels would not visually 
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Fig. 4 | Widespread similarities in the activity levels of six species of coral 
reef damselfish regardless of whether acclimated to present-day or end-of- 
century levels of CO,.a-f, Activity levels (s min“) after acclimation to control 
(around 450 ppatm; closed grey circles) or end-of-century (about 1,000 patm; 
open blue circles) levels of CO,. Mean values for individual animals are shown 
(small symbols). The large symbols and error bars represent the mean + 95% 
confidence intervals for each group. Data for A. polyacanthus (a) and 

D. aruanus (b) were collected across multiple years (indicated at the top 


detect the shortcut as strongly as control fish. By contrast, we found 
that fish from both treatment groups exhibited a preference for the 
shorter path (Extended Data Fig. 2g and Extended Data Table 5). 


Conclusions and implications 


Here we presenta multi-species, multi-year and multi-life-stage exami- 
nation of the sensory and behavioural impairments that have been 
reported for coral reef fishes under end-of-century levels of CO,, thus 
answering an international call for comprehensive replication studies 
onissues of global importance”. Notably, we took great care to enhance 
transparency by systematically documenting our experiments and 
providing raw data and analysis code. In contrast to previous studies on 
the same and closely related species, we found no consistent detrimen- 
tal effects of end-of-century CO, levels on the avoidance of predator 
chemical cues, activity levels or behavioural lateralization. Although 
CO, emissions are an environmental threat*”, the catastrophic projec- 
tions for fish sustainability based on CO,-induced behavioural impair- 
ments”"3 must be reassessed in light of our findings. 

We went to great lengths to match the species, life stages, location 
and season of previous studies, yet the discrepancies in findings were 
considerable. This was most apparent for the responses of fish to preda- 
tor chemical cues, for which previous studies have reported extreme 
effect sizes (in which control fish spent <10% of their time in predator 
cues compared with >90% of time for fish under high CO,; Fig. 3a—c) 
with exceedingly low variability around the group means (Fig. 3d-f). 
The research community in the field of ocean acidification and coral 
reef fish behaviour has remained small, and the study systems are often 
remote and expensive to access, both of which have precluded inde- 
pendent assessments of previous findings. Small sample sizes and 
other methodological or analytical weaknesses” in previous studies 
could potentially explain the discrepancies between our results and the 
majority of articles that have reported minor impacts (small effect sizes) 
of CO, on fish behaviour. However, we cannot reconcile our findings 
with those that show extremely large effect sizes and small within-group 
variance in experiments with large sample sizes (Fig. 3). Inter-individual 
variation enables the persistence of populations and species and is a 
fundamental biological phenomenon on which selection acts; results 
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of each panel), whereas data for C. atripectoralis (c), P.amboinensis (d) and 

P. moluccensis (e) were collected in 2014 and data for D. perspicillatus (f) were 
collected in 2016. n numbers along the bottom of the figure panels represent 
biologically independent animals. Note that there were some statistically 
significant (two-tailed tests), context-dependent effects of CO, treatment 
for A. polyacanthus and D. aruanus, including interactions with body size 
(see Extended Data Fig. 1; statistics are included in Extended Data Table 3). 


showing negligible variation (particularly for behaviours that are inher- 
ently variable) should be viewed with caution (see Supplementary 
Information). 

On the basis of our findings on more than 900 wild and captive- 
reared individuals of 6 species across 3 years, we conclude that accli- 
mation to end-of-century levels of CO, does not meaningfully alter 
important behaviours of coral reef fishes. Reasonably large sample 
sizes and consistent results across species, locations, life stages and 
years suggest that the probability of false-negative results (type-II 
errors) in our study is low. Given the importance of these issues to 
the management of coral reefs and other aquatic ecosystems”, we 
encourage further replication of previous studies using the transpar- 
ent and objective approaches described here (for example, video 
footage with pre-trial notes, complete data and code archiving)” ”’. 
Only then will the research community be equipped to reach a con- 
sensus on whether end-of-century ocean acidification could have 
direct effects on the behaviour of fishes. Nonetheless, it should be 
firmly emphasized that there is strong evidence that increasing 
atmospheric CO, is causing ocean warming, which can profoundly 
affect marine fishes”. 
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Methods 


Experiments were conducted across 3 years (2014-2016), at 2 locations 
in Australia (the Lizard Island Research Station (LIRS) and the Austral- 
ian Institute of Marine Science (AIMS) in Townsville), and ona total of 
more than 900 individuals from 6 species across an ontogenetic range. 
The experimental designs and CO, dosing systems (described below) 
followed best practices for ocean acidification research® ~. For all 
experiments, fish were given at least 4 days to acclimate to the CO, 
treatment before trials commenced. Although an acclimation period 
of 4 days is short, this duration was chosen because it has been reported 
to be sufficient to maximize behavioural and/or sensory impairments 
in fishes>”’. Fish were placed in the two treatment groups at random. 
Other aspects of water chemistry (that is, the water supply used and 
the temperature it was kept at), lighting and feeding were kept constant 
among replicate tanks across the two acclimation treatments. Juvenile 
fish were mostly used in the experiments but when adult fish were 
used, we did not determine their sex in order to minimize handling. The 
sample sizes used in each experiment were based on previous studies 
and fish availability. Complete blinding regarding CO, treatment was 
not possible as the CO, dosing system was visible (both visually and 
auditory) to any observer physically present during the experiments. 
However, all activity and predator cue avoidance experiments were 
recorded on video and analysed using automated tracking software. 
Lateralization experiments could not be tracked using automated 
tracking software but were scored in real-time. A detailed descrip- 
tion of the methods is included in the Supplementary Information. 
All experiments were conducted in compliance with relevant ethical 
regulations under approval from the James Cook University Animal 
Ethics Committee in association with the AIMS (permit A1924). 


Animals and holding conditions 

LIRS August 2014. Sub-adult and adult wild fishes (humbug dascyl- 
lus (D. aruanus),n=46; Ambon damsel (P. amboinensis), n=43; lemon 
damsel (P. moluccensis), n= 49; black-axil chromis (C. atripectoralis), 
n=43; and spiny chromis (A. polyacanthus), n=16) were collected from 
around Lizard Island at the northern end of the Great Barrier Reef, 
Australia (14° 40’ S, 145° 28’ E), by SCUBA divers using hand and/or 
barrier nets and spray bottles of clove oil anaesthetic (mixed 1:4 with 
ethanol). To produce predator chemical cues, predatory blue-spotted 
rock cods (C. cyanostigma; n= 24) were collected using hook and line. 
All fishes were transported in aerated seawater to LIRS, where they 
were placed in tanks with flow-through seawater (35 PSU) at ambient 
temperature (Extended Data Table 1). The damselfishes were divided in 
approximately even numbers between eight identical tanks (251 each; 
31 min“ flow-through). C. cyanostigma were divided in even numbers 
between two identical tanks (200 | each; 121 min™ flow-through) and 
fed pieces of sardine (Sardinops sagax) every 2-3 days. 

After 1-2 days in captivity, the CO, of half of the tanks (including one 
of the C. cyanostigma tanks) was gradually increased to 945 +117 patm 
(mean +s.d.) (DH;ora Of around 7.72, calculated using previously pub- 
lished constants*°”’; Extended Data Table 1) over 24 husing a CO, dosing 
system (pH stat Computers, Aqua Medic) connected to solenoid valves 
that regulate the administration of 100% CO, gas (as previously 
described*’). Although 24 h may seem a short duration over which to 
increase CO, to end-of-century levels, fish have a well-developed phys- 
iological capacity to endure much larger and/or quicker changes in Poo 
levels®*’. In addition, some previous studies have reported that fish 
were simply transferred to end-of-century p,. treatments rather than 
using a gradual change””***! and others did nat report how fish were 
transferred to high p., levels**”. The other half ofthe tanks remained 
at ambient CO, levels Of 406 + 21 patm (pH, o. Of approximately 8.04; 
Extended Data Table 1). Levels of CO, in each tank were checked twice 
daily using a handheld CO, meter (GMT 222, Vaisala) connected to an 
aspiration pump (Vaisala) and asubmerged gas-permeable PFTE probe 


(Qubit Systems) as described previously”. The CO, meter was factory- 
calibrated by Vaisala before experiments. Water samples (60-ml samples 
of water with 30 pl of mercury chloride to poison any microorganisms) 
were taken at 10 different points throughout the experiment for sub- 
sequent measurements of total alkalinity (Extended Data Table 1). Fish 
were fed to satiation 1-2 times per day with a commercial pellet food, 
but food was withheld for around 12 h before experiments. Tanks were 
cleaned every 3-4 days. Individual fish were reused for each of the three 
response variables that we measured (activity, behavioural lateraliza- 
tion and predator chemical cue avoidance) in a randomized order. At 
the end of the experiments, fish were released at their site of capture. 


AIMS May and June 2015. Juvenile spiny chromis (A. polyacanthus) 
(age, 3-14 days after hatching, 0.019 + 0.015 g (mean +s.d.) initial wet 
weight, 9.1 + 2.3 mm initial standard length) were obtained from the 
Reef HQ aquarium in Townsville, Australia (total n=1,494). In addition, 
groups of wild A. polyacanthus juveniles (10-15 days after hatching) 
from four distinct schools (four breeding pairs) were corralled into 
clear containers by SCUBA divers at depths of 8-10 m at Davies Reef 
(18.8238° S, 147.6429° E) in April 2015 (n = 481 collected). Fish were 
transported in aerated seawater to AIMS, where they were placed in 
25-I tanks with seawater recirculating (around 3.5 | min”) to one of 
four independent 200-I sumps, which themselves were continuously 
flushed with fresh seawater (4-7 | min”). Subsets of fish from Reef HO 
were used for assessments of predator cue avoidance, activity levels 
and behavioural lateralization, whereas wild fish were only used in be- 
havioural lateralization experiments. Four wild predatory fish (flagtail 
grouper (C. urodeta)) were freighted to AIMS and split evenly between 
two tanks after being caught from the northern Great Barrier Reef by 
Cairns Marine. The effluent water from the grouper tanks went straight 
to the drains to ensure that the A. polyacanthus did not habituate to 
predator chemical cues. C. urodeta were fed freshly killed juvenile A. 
polyacanthus every 1-2 days as previously described®. 

After at least 24 h to recover from transport, the CO, of half of the A. 
polyacanthustanks (n=10) and one of the C. urodeta tanks was gradually 
increased to1,021+156 atm (mean +s.d.) (pH,.14, of around 7.70; Extended 
Data Table 1) over 24 h using a CO, dosing system (pH stat Computers, 
Aqua Medic) connected to solenoid valves that regulate the administra- 
tion of 100% CO, gas into two of the partial-recirculation sump systems. 
The remaining tanks (n=10 for A. polyacanthus and n=1 for C. urodeta) 
were kept at ambient CO, levels (428 + 13 patm, pH,,¢., of around 8.03; 
Extended Data Table 1). Three large air stones in each sump ensured that 
the water remained well mixed and maintained dissolved oxygen at >90% 
air saturation. The CO, levels of the holding tanks were checked every 
1-4 days using a LI-820 CO, Gas Analyzer (LI-COR). Fish were exposed to 
natural water temperatures for the region (quantified using thermal data- 
loggers sampling every 30 min; iButton, Maxim Integrated). Temperature 
decreased seasonally from 26.1+ 0.2 °C during the first week of acclima- 
tion (May 2015) to 24.8 +0.5 °C during the final week of experiments (June 
2015; Extended Data Table 1). Salinity was regulated through the AIMS 
SeaSim aquarium system (35.8 + 0.15 PSU). Water samples for alkalinity 
were takenas described above for LIRS 2014 (five samples per treatment, 
Extended Data Table 1). Fish were fed ad libitum 1-2 times per day using 
commercial aquaculture pellets crushed toa powder and/or Artemia spp. 
nauplii, but food was withheld for 12-18 h before experiments. Tanks were 
cleaned weekly. Individual fish were used once; thatis, for one of the three 
response variables that we measured (activity, behavioural lateraliza- 
tion or predator chemical cue avoidance). All fish used at AIMS in 2015 
were euthanized with an overdose of tricaine methanesulfonate (MS-222, 
around 500 mg”) at the end of the experiments, or at intermittent times 
during the experiments when they were euthanized to take precise length 
and weight measurements for another study”. 


LIRS January 2016. Wild fishes were collected from around Lizard Is- 
land, as described above for LIRS 2014. Adult predatory C. cyanostigma 


(n=15) were caught using hook and line, and three damselfish species 
were caught using clove oil spray and hand or barrier nets (sub-adult 
and adult D. aruanus (n= 96); juvenile, sub-adult and adult A. polya- 
canthus (n= 112); sub-adult and adult white damsel (D. perspicillatus; 
n=50)). Note that A. polyacanthus does not have a pelagic larval phase 
(see Supplementary Information). In addition, larval D. perspicillatus 
(n= 72) were caught near the end of their pelagic phase using estab- 
lished light-trapping techniques**. Fishes were placed in tanks with 
flow-through seawater at ambient temperature (Extended Data Ta- 
ble 1). The damselfishes were divided in approximately even numbers 
between 22 identical tanks that each received constant flow-through 
(one species per tank, 7-8 tanks per species; 10-251 eachand 1-31 min“ 
flow-through, depending on fish size). C. cyanostigma were divided in 
even numbers between four identical flow-through tanks (60 | each; 
31 min™ flow-through) and fed sardine pieces and freshly killed adult 
damselfish every 2-3 days. All tanks were provided with pieces of PVC 
piping to act as shelter for the fish. 

After 1-2 days in captivity, the CO, of half of the tanks (n = 11 dam- 
selfish tanks, n =2 C. cyanostigma tanks) was gradually increased to 
1,089 + 326 patm (mean + s.d.) over 24 h using a CO, dosing system 
as described above for LIRS 2014, while the other half of the tanks 
remained at ambient CO, levels of 521+ 93 patm (Extended Data Table 1). 
Levels of CO, in each tank were checked twice daily using the handheld 
Vaisala as described above for LIRS 2014. Damselfishes were fed to 
satiation 1-2 times per day witha commercial fish flake—saltwater slurry 
(TetraMin Tropical Flakes, Tetra), but food was withheld for around12h 
before experiments. Tanks were cleaned every 3-4 days. Individual fish 
were used once; the two measured response variables were obtained 
froma single, continuous behavioural trial (activity followed by preda- 
tor chemical cue avoidance). At the end of the experiments, fish were 
released at the approximate site of capture. 


Response to predator chemical cues 

LIRS 2014. Four species were examined for their responses to preda- 
tor chemical cues (P. amboinensis (standard length range, 23-53 mm), 
C. atripectoralis (standard length, 15-43 mm), D. aruanus (standard 
length, 16-63 mm) and P. moluccensis (standard length, 19-34 mm); 
sample sizes are provided in Fig. 1and Extended Data Table 2), using a 
two-current choice flume. The setup for the two-current choice flume 
followed established protocols*® (for details, see Supplementary In- 
formation). The fish inthe high CO, group had been acclimated to the 
CO, treatment for 5-16 days before commencement of experiments, 
while control fish had been held for 4-16 days. The choice flume was 
acustom-built, larger version (L x Wx H=580 x 260 x 280 mm’; water 
depth, 80 mm) of atwo-current choice flume used in previous studies. 
Detailed information onthe design and function of two-current choice 
flumes has been described previously* (for details, see Supplementary 
Information). C. cyanostigma was used to create predator chemical cues 
(see Supplementary Information for details). All trials in the choice 
flume were recorded using a computer with a webcam (Logitech HD 
Pro C920) positioned 45 cm above the choice arena. At the beginning 
of atrial, a paper note detailing the treatment history of the individ- 
ual fish was placed in view of the camera before the fish was placed 
into the centre of the choice arena within a bottomless mesh cylinder 
(70-mm diameter) for 1.5-2 min. This step was included to ensure that 
the fish had the opportunity to receive sensory input from both sides 
of the choice flume—one side flowing with unmanipulated water and 
the other side flowing with water containing the predator cue. After 
the settling period, the mesh cylinder was carefully lifted and the fish 
was allowed to select its position within the flume. After a further 8 min, 
the configuration of flow through each side of the flume was switched 
using a series of valves such that water containing the predator cue 
now flowed through the opposite side of the flume. The valves were 
positioned near the secondary header tanks and could be adjusted 
without visually or physically disturbing the fish. The fish was given a 


further 8 min to select its position in the flume with the new flow con- 
figuration before being removed and returned to its holding tank. The 
video files were analysed using tracking software (ViewPoint, Zebralab) 
to automatically quantify time spent in the flow of water (side of the 
flume) containing the predator cue. 


AIMS 2015. The general flume setup used at AIMS followed the design 
described above, withsome exceptions. Two choice flumes were used 
side-by-side under the view of asingle camera (Microsoft LifeCam HD 
5000, mounted around 45 cm above the tank) recording at 10 frames 
per second witha resolution of 1,280 x 720 pixels. To match the smaller 
size of the fish (compared with the fish of the LIRS 2014 dataset), we used 
choice flumes with an arena that was 90 mm long x 45 mm wide with 
a water depth of 22 mm (4.9 mms‘ water speed, around 135 ml min™ 
per current). We initially tested flumes built to the exact specifications 
of those used in previous papers***”°. However, we were unable to pro- 
duce laminar flow using this setup; both incoming streams of water 
mixed in the test section of the flume, meaning that the fish would not 
be able to make a choice between the different currents (https://youtu. 
be/jrtyc-rLGWc?t=705, see Supplementary Information for details). 

The fish (A. polyacanthus (standard length, 9-11 mm) from Reef 
HQ aquarium) were acclimated to their respective CO, conditions for 
6-13 days before being used in choice flume trials. The predator chemi- 
cal cue avoidance trials (n = 50 control, n= 50 high CO.) followed the 
same protocol as at LIRS 2014 (see above; total duration of 18 min), 
including the presentation of an explanatory note in front of the camera 
before each trial. C. urodeta was used to create predator chemical cues 
(see Supplementary Information for details). 


LIRS 2016. Three species across an ontogenetic range were examined 
for their responses to predator chemical cues at LIRS in January 2016 
(A. polyacanthus, D. aruanus and D. perspicillatus from early life stages 
(7.5-14.5 mm standard length) and later life stages (15.0-51.0 mm stand- 
ard length; sample sizes listed in Fig. 2and Extended Data Table 2). Five 
two-current choice flumes were used in parallel (one 610 mm x 200 mm, 
two 290 mm x 93 mm, and two 235 mm x 45 mm, for details see Sup- 
plementary Information). All trials in the choice flumes were recorded 
using acomputer with webcams (Logitech HD Pro C920, FireWire cam- 
era, Dragonfly 2, Point Gray; Microsoft LifeCam HD 5000 webcam) 
positioned 45-130 cm above the choice arenas (depending on camera 
type and flume size). Trials were executed ina similar manner as at LIRS 
in 2014. At the commencement of atrial, a paper note detailing the 
treatment history of each fish was placed in view of the relevant camera 
before the fish was placed into the centre of the choice arena (no mesh 
cylinder was used) of the flume. Unlike during the predator chemical 
cue trials described for LIRS 2014 and AIMS 2015, the fish were given 
40 minto settle inthe flumes with unmanipulated water running down 
both sides (that is, no predator cue) before the cue was added to one 
side for 20 min, before switching the predator cue to the other side for 
the final 20 min. C. cyanostigma was used to create predator chemical 
cues (see Supplementary Information for details). The video files were 
analysed using tracking software (ViewPoint, Zebralab) for subsequent 
analyses of activity levels (defined as seconds per minute spent swim- 
ming morethan 0.5 standard lengths per second) and time spent inthe 
side of the flume containing the predator cue. Anexample of a full day 
of flume trials can be found at https://youtu.be/iHOW7Wqztjo. 


Activity levels 

LIRS 2014. Eight tanks (2 x 4 arrangement) were used to moni- 
tor activity in five species (Extended Data Table 3). Each tank was 
220 mm x 140 mm x 140 mm (L x Wx H; water depth, 105 mm) and con- 
tained 3.21 of flow-through water (70 ml min”, using the same header 
tank system as described above for LIRS 2014). Each tank was equipped 
with a halved piece of 50-mm diameter PVC pipe standing on its end 
(height 50 mm), which provided a vertical structure for the fish to use as 
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shelter. A video camera (Panasonic HC-V130) was positioned 1m above 
the tanks to monitor fish activity at all times. At the commencement of 
each trial, a paper note detailing the treatment history of the fish was 
placed in view of the camera before introducing individual fish into 
each tank. The fish were then video-monitored for activity levels for 
27 min. Sample sizes for 2014 swimming activity trials are included in 
Fig. 4 and Extended Data Table 3. 


AIMS 2015. The two choice flumes described above for use at AIMS 
in 2015 were also used for separate assessments of the activity levels 
of captive-reared A. polyacanthus for the two acclimation treatments 
(n=28 fish from control; n=38 fish from high CO,; fish standard length, 
11.7 + 1.6 mm (mean + s.d.); Extended Data Table 3) in unmanipulated 
acclimation water (that is, no predator cue). For these trials, fish were 
transferred from their home tank (without air exposure) into a flume 
and recorded for 2 h (Microsoft LifeCam HD 5000, mounted around 
45 cmabove the flume). 


LIRS 2016. Activity trials were conducted in the choice flumes de- 
scribed above for LIRS 2016; activity levels were monitored for the 
first 40 min of the experimental trials before releasing any chemical 
stimulus into either side of the flume. Five flumes were used in parallel 
and the flume dimensions and water velocities are described above. 
Additional large adult A. polyacanthus (n=9 control, 9 high CO,) and 
D. aruanus (n= 6 control, 7 high CO,) were tested in white opaque tanks 
(43 x 32.5 cm’, water depth, 10 cm). Sample sizes are provided in Fig. 4 
and Extended Data Table 3. 


Behavioural lateralization 

LIRS 2014. A double-ended opaque plastic T-maze (39 x 29 x 20 cm’, 
LxWxH) was constructed to perform detour tests to examine behav- 
ioural lateralization injuveniles and adults of four species (P. amboin- 
ensis, control n= 21, high CO, n = 22; C. atripectoralis, control n= 26, 
high CO, n=17; D. aruanus, control n=19, high CO, n=21;P. moluccensis, 
control n = 29, high CO, n= 20). The double T-maze was a modified 
version of those used in experiments that have been described previ- 
ously*”“®, Individual fish were netted from their tanks and transferred 
immediately to the double-ended T-maze. Fish were given 1 min to settle 
inthe central channel of the T-maze before the trial commenced. Later- 
alization experiments consisted of an experimenter first manoeuvring 
the fish to the starting point of the channel and then coaxing it down the 
channel with perforated plastic paddles for 10 consecutive runs. Fish 
had to make a decision to turn left or right each time they reached the 
perpendicular barrier at the end of the channel. All lateralization tests 
were recorded on video (using an Olympus Tough TGI1 or a Panasonic 
Lumix DMC-FT4 camera). 


AIMS 2015. A double-ended T-maze (31 x 11 x 13 cm’*, L x Wx H) simi- 
lar to the maze described above was constructed to perform detour 
tests injuvenile A. polyacanthus. Wild-caught fish (10-33 mm standard 
length; control n=54; high CO, n=42) as well as captive-reared fish from 
Reef HQ aquarium (8-33 mm standard length; control n= 66; high CO, 
n= 62) were used. The lateralization trials at AIMS followed the method 
described above for LIRS with the exception that 20 rather than 10 
consecutive turns were recorded and the fish were given 2 min rather 
than1 min of settling time upon entrance to the arena. In addition, the 
barrier at one end of the central channel was offset by 5 mm to create 
a situation in which the path around the barrier was shorter if the fish 
turned left rather than right (rationale and further details are provided 
inthe Supplementary Information). 


Statistics 

General analyses. Time spent in predator cue and activity levels were 
quantified for each min of the behavioural trial for each fish using 
tracking software, which meant many repeat observations for each 


individual. However, three limitations prevented us from analysing the 
data over time. First, the effect of time was nonlinear. Second, the data 
were temporally auto-correlated. Third, the data were bimodal around 
the minimum and maximum values (see Extended Data Fig. 3 for an 
example) and did not conform to any distribution readily available for 
use in generalized additive mixed models (with the mgcv package in R). 
For simplicity, we took a mean across the entire trial for each fish (for 
choice flume and activity data; see below), which resulted in data being 
normally distributed and without auto-correlated repeated measure- 
ments, allowing us to use general linear models (see Supplementary 
Information for additional details). 


Response to predator chemical cues. General linear models were 
used to test for the effects of CO, treatment (present-day versus 
end-of-century levels) and fish size (standard length in mm) onthe 
percentage of time that fish spent on the side of the flume that con- 
tained the predator cue. Among the six species, there were different 
sample sizes, size ranges and years (or locations, for details see Sup- 
plementary Information) in which the fish were tested. Therefore, 
we built separate models for each species—year combination (n=9 
models). We used backwards model selection, beginning by including 
an interaction between the two fixed effects (treatment, standard 
length): F-tests were used to assess the significance of removal of 
model terms on the Akaike information criterion (AIC) (using the 
‘drop!’ function in R). For model selection, a was set to 0.05. We ac- 
knowledge that these (two-tailed) tests were repeated on multiple 
species and multiple response variables, inflating the potential for 
type-I errors; however see a previous study’. Therefore, in our inter- 
pretations, although we refer to effects with P< 0.05 as ‘significant’, 
we emphasize the strength and size of effects, recognizing that P 
values have limitations and represent a continuum of statistical 
significance. Model assumptions were assessed with q-q plots of 
residuals and by plotting residuals against fitted values and against 
each of our predictor variables”. 


Bootstrapping. Most previous studies have used more rapid assess- 
ments of cue preferences than in the present study, in which 4 min 
of measurements have been taken during 9-11 min trials (typically a 
2-min post-handling settling period, 2 min measurement, 3 min for 
side switch and post-switch settling, 2 min measurement)**”"°*>*”, For 
direct comparisons with these studies in our bootstrapping simulations 
(see Supplementary Information), we averaged 2 min of data after a 
2-min post-handling settling period and 2 min of data 3 min after the 
cue side switch (2014 and 2015), or we averaged 2 min of data 2 min after 
the predator cue was first introduced to the choice flume and 2 min of 
data 3 min after the cue side switch (2016). The bootstrapping results 
are presented in Fig. 3, with comparisons to seven papers*>* 16.25.2732, 
Note that another study”—which is also included in Fig. 3—is included 
for comparative purposes. The extremely high variance in one paper” 
(Fig. 3f) was caused by an exceedingly high proportion of control in- 
dividuals reported to have spent 0% of their time in the conspecific 
chemical alarm cue (grey solid bars in Extended Data Fig. 4a) and an 
equally high proportion of high CO, individuals reported to have spent 
100% of their time in the cue (blue solid bars in Extended Data Fig. 4b). 
Additionally, control and high CO, data were pooled to calculate the 
associated variance around the group means for each of the sample size 
scenarios (Fig. 3d-f), similar to a previously published method*.. For ad- 
ditional details on the bootstrapping, see Supplementary Information. 


Activity levels. Time spent active (s) was calculated on a minute- 
by-minute basis (to give s min”). However, data were analysed as one 
value (mean of the trial for each fish) per individual, using the same 
general linear modelling procedures outlined above for ‘Response to 
predator chemical cues’. See Supplementary Information for further 
details. 


Behavioural lateralization. Data collected from each location and year 
were analysed separately owing to the differences in time of year, species 
used and exposure duration. Testing for lateralization is not straightfor- 
ward because it involves multiple binomial experiments with structure; 
a description of issues with the statistical approaches used by previous 
studies to assess lateralization can be found elsewhere™. A test for detect- 
ing lateralization at the population level requires examining the mean 
lateralization score across all individuals inthe sample as population-level 
lateralizationis present whena group of individuals collectively exhibits 
aside bias. By contrast, atest for detecting individual-level lateralization 
requires examining the sample variance as individual-level lateralization is 
present when more individuals exhibit a side bias than expected by chance 
(irrespective of whether it is to the left or to the right). Explanations and 
examples of these two concepts have been published previously***?™. 
Wetested population-level lateralization with a generalized linear mixed 
model (with glmer functionin R) that sets the intercept equal to the grand 
mean of the data”. We tested individual-level lateralization witha x’ test 
comparing the observed variance (numerator) to the expected variance 
(denominator) assuming a normal approximation to the binomial distri- 
bution”. This is analogous to testing for overdispersion (thatis, are there 
more observations in the tail ends of the distribution than expected by 
chance). See Supplementary Information for further details. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The data necessary to reproduce figures and results in this study are 
publicly archived in Figshare following best-practice guidelines», and 
were made available to editors and reviewers at the time of submission: 
https://doi.org/10.6084/m9.figshare.7871522. We place no restrictions 
on data availability. 
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code availability. 
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Extended Data Fig. 1| Interactions between CO, treatment and size for high CO,) and size (xaxis), with shaded areas indicating 95% confidence 

fish activity. a—c, Raw data points and fitted model estimates for activity in intervals of model estimates. Model parameter estimates are included in 

D. aruanus in 2014 (a), A. polyacanthus in 2014 (b) and A. polyacanthus in 2015 (c) Extended Data Table 3. a, n=23 per treatment. b,n=8 per treatment.c, Control, 
asa function of acclimation treatment (grey diamonds, control; blue circles, n=28;CO,,n=38.Sample sizes represent biologically independent animals. 
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Extended Data Fig. 2 | Widespread resilience of behavioural lateralization in 
coral reef damselfishes when faced with end-of-century levels of CO,. 

a-g, Number of right turns (out of 10) under control (closed grey bars) and high 
CO, (open blue bars) conditions for fishes facing either a centred barrier at one 
end of the T-maze (a-f) or an offset barrier at the other end of the T-maze (g). 
Sample sizes represent biologically independent animals). a, P. moluccensis. 
Control, n=29;CO,,n=20.b, C. atripectoralis. Control, n=26;CO,,n=17.c, 
D.aruanus. Control, n=19; CO,,n=21.d, P. amboinensis. Control, n=21; CO,, 


01234567 8 910 
Right turns 


A. polyacanthus 


P. amboinensis 


P. amboinensis 
(retested) 
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(offset barrier) 


di 
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n=22.e,P.amboinensis retested. Control, n=15;CO,,n=15.f, A. polyacanthus. 
Control, n=120; CO,,n=104. g,A. polyacanthus (same sample sizes as inf). 
a-e, Data were obtained at the LIRS in 2014. f, g, Data were obtained at the AIMS 
in 2015. Dashed lines represent the mean number of right turns for each 
treatment group. A tick mark on the panel (coloured according to treatment) 
indicates significant individual-level lateralization, whereas an asterisk at the 
top of the panel indicates significant population-level lateralization. See 
Extended Data Tables 4, 5 for statistics. 
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Extended Data Fig. 3 | Histogram of the percentage of time in predator cue data for fish used in choice flume trials at LIRS in 2016. Each data point included in 
this summary represents analysis of one minute of behavioural data for a fish; the plot contains many repeated measurements for each fish. 


Welch et al. 2014 


a mean 


Frequency (%) 


Frequency (%) 


Control 


Present study 


mean 


Present study Welch et al. 2014 


mean mean — 


Time in cue (%) 


Extended Data Fig. 4| Histogram ofrepresentativedataforpercentageoftime A. polyacanthusinhigh CO, water from parents acclimated to high CO, water 


spentin water containing predator cue or conspecific alarm cue. Histograms 


(b,n=62). Also presented are data (4-min means) from the present study 


of representative data (4-min means) froma previous study” (solidbars)showing — (6species, openbars;n=247 control, n=239 high CO,) showing peak frequencies 


the disproportionate number of fish that were reported to spend 0% of timein 
conspecific chemical alarm cue when acclimated to control water (a) 

or 100% of time in the cue when acclimated to water with elevated CO, levels (b). 
The representative treatment groups” arejuvenile A. polyacanthus in control 
water from parents acclimated to high CO, water (a, n= 62) and juvenile 


around 50% of time in predator cue for both control (a) and high-CO,-exposed 
(b) fish. Sample sizes represent biologically independent animals. Mean values 
for each of the datasets are indicated with vertical lines, and arrows are directed 
at modal values in each of the datasets. 
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Extended Data Table 1| Water chemistry data for the two sites for the three years of the study 


Location/year Treatment pCO2 pCOz Temperature Temperature Alkalinity Alkalinity PHtotal 
(uatm) n (°C) n (umol kg") n (calc.) 
LIRS 2014 Control 406 (21) 19 23.1 (0.9) 2,048 2,292 (2.9) 10 8.04 
High CO2 945 (117) 66 23.1 (0.9) 2,048 2,298 (4.7) 10 7.72 
AIMS 2015 Control 428 (13) 1,907 25.8 (1.0) 5,747 2,356 (5.8) 5 8.03 
High CO2 1,021 (156) 11,537 25.8 (1.0) 3,831 2,355 (5.5) 5 7.70 
LIRS 2016 Control 521 (93) 8 29.4 (0.8) 147 - - - 
High CO2 1,089 (326) 234 29.6 (0.8) 388 - - - 


Data were obtained from the LIRS and AIMS between 2014 and 2016. pco, was measured every 1-4 days, temperature was logged using iButton data-loggers (1 sample per 30 min), and alkalinity 
was measured from 10 samples at LIRS in 2014 and from five samples at AIMS in 2015. Data are presented as mean + s.d. Sample sizes are included in the table. Some of the data for AIMS 
(2015)°“3 and LIRS (2016)°° have previously been published previously. 


Extended Data Table 2 | Model predictions of time spent in predator chemical cue 


Parameter name Parameter estimate (+ SE) t PF 
Humbug, D. aruanus (2014) 

n= 38, model R? = 0.03 

Model intercept 32.36 + 3.59 9.02 <.001 
Acclimation (CO2) 5.67 + 4.94 1.14 0.26 
Humbug, D. aruanus (2016) 

n= 83, model R? = 0.13 


Model intercept 11.95 + 4.07 2.93 0.004 
Acclimation (CO2) 4.23 + 2.47 4 0.09 
SL (mm) 0.44 + 0.13 3.47 <.001 


Spiny chromis, A. po/yacanthus (2015) 

n= 100, model R? = 0.19 

Model intercept 38.57 + 2.29 16.87 <.001 
Acclimation (CO2) 15.69 + 3.23 4.85 <.001 
Spiny chromis, A. po/yacanthus (2016) 

n= 94, model R? = -0.01 

Model intercept 43.58 + 1.93 22.56 <.001 
Acclimation (CO2) -0.54 + 2.96 -0.18 0.86 
Ambon damsel, P. amboinensis (2014) 

n= 38, model R? = -0.02 

Model intercept 31.47 + 3.51 8.98 <.001 
Acclimation (CO2) 1.95 + 4.51 0.43 0.67 
White damsel, D. perspicillatus (2016) 

n= 119, model R? = 0.04 


Model intercept 39.83 + 1.73 22.98 <.001 
Acclimation (CO2) 2.26 + 2.16 1.05 0.30 
Size class (large) -5.19 + 2.19 -2.37 0.02 


Lemon damsel, P. moluccensis (2014) 

n= 48, model R? = 0.01 

Model intercept 27.05 + 3.04 8.90 <.001 
Acclimation (CO2) 5.50 + 4.39 1.25 0.22 
Black-axil chromis, C. atripectoralis (2014) 

n= 40, model R? = -0.03 

Model intercept 30.58 + 4.99 6.13 <.001 
Acclimation (CO2) -0.03 + 6.31 -0.005 0.996 


Parameters (+ standard error) and their statistical significance for general linear models that predict the mean percentage of time spent on the side of the choice flume containing the predator 
chemical cue for an individual fish for six species of coral reef fishes tested in this study. We used backwards model selection using F-tests to compare the AIC of models with and without each 
predictor variable using the ‘drop1’ function in R. Only the parameter estimates for the final (best) models are given, although we always kept the main effect of acclimation treatment (that is, 
the effect of acclimation to elevated CO,) in place because it was the key variable of interest. Note that for the D. perspicillatus model, the baseline factor level for size class was ‘early-stage 
juveniles’ (<15 mm standard length). Sample sizes represent biologically independent animals. Statistical significance is indicated in bold (a = 0.05). 
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Extended Data Table 3 | Model predictions of mean activity levels 


Parameter name Parameter estimate (+ SE) t P 

D. aruanus (2014), n = 46, model R? = 0.31 

Model intercept 25.19 + 7.05 3.57 <.001 
Acclimation (CO2) 31.83 + 9.91 3.21 0.003 
SL (mm) -0.08 + 0.19 -0.44 0.66 
COz2 x SL (mm) -0.59 + 0.26 2.23 0.03 
D. aruanus (2016), n = 94, model R? = 0.00 

Model intercept 41.14 + 1.37 30.78 <.001 
Acclimation (CO2) 0.96 + 1.96 0.49 0.63 
A. polyacanthus (2014), n = 16, model R? = 0.71 

Model intercept 48.11 + 7.81 6.16 <.001 
Acclimation (CO2) 9.66 + 3.12 3.10 0.009 
SL (mm) -0.59 + 0.11 -5.28 <.001 
A. polyacanthus (2015), n = 66, model R? = 0.19 

Model intercept 23.33 + 9.60 2.43 0.01 
Acclimation (CO2) 19.76 + 11.81 1.67 0.10 
SL (mm) 0.30 + 0.79 0.38 0.71 
COz2 x SL (mm) -2.11 + 0.99 -2.13 0.04 
A. polyacanthus (2016), n = 112, model R? = 0.34 

Model intercept 38.24 + 1.53 24.98 <.001 
Acclimation (CO2) 0.34 + 1.77 0.19 0.85 
SL (mm) -0.32 + 0.04 -7.64 <.001 
P. amboinensis (2014), n = 43, model R? = -0.02 

Model intercept 17.34 + 2.58 6.71 <.001 
Acclimation (CO2) 1.05 + 3.61 0.29 0.77 
D. perspicillatus (2016), n = 122, model R? = 0.21 

Model intercept 35.47 + 1.96 18.10 <.001 
Acclimation (CO2) -1.93 + 1.69 -1.15 0.25 
SL (mm) -0.64 + 0.11 -5.71 <.001 
P. moluccensis (2014), n = 49, model R? = 0.05 

Model intercept 19.80 + 2.15 9.21 <.001 
Acclimation (CO2) 6.44 + 3.45 1.81 0.08 
C. atripectoralis (2014), n = 34, model R? = -0.03 

Model intercept 26.10 + 2.22 11.77 <.001 
Acclimation (CO2) -1.32 + 3.46 -0.38 0.71 


Parameters and their statistical significance for general linear models that predict the individual mean activity levels (swimming (s min”)) for six species of coral reef fishes tested in this study. 
We used backwards model selection using F-tests to compare the AIC of models with and without each predictor variable using the ‘drop?’ function in R. Only the parameter estimates for 

the final (best) models are given, although we always kept the main effect of acclimation treatment in place because it was the key variable of interest. Sample sizes represent biologically 
independent animals. Statistical significance is indicated in bold (a = 0.05). 


Extended Data Table 4 | Individual- and population-level lateralization of coral reef fishes 


Treatment Species n ind x? indP  X popz  popP 
Control P. amboinensis 21 36.38 0.014 538 0.83 0.404 
C. atripectoralis 26 =. 37.18 0.055 5.04 0.11 0.912 
D. aruanus 19 9.52 0.947 5.11 0.29 0.772 
P. moluccensis 29 26.89 0.524 552 1.76 0.079 
P. amboinensis (retest) 15 22.56 0.068 4.20 -1.59 0.112 
High CO2 P. amboinensis 22 =«=7.51 0.997 5.31 0.943 0.346 
C. atripectoralis 17 14.82 0.538 5.76 1.99 0.047 
D. aruanus 21 10.78 0.952 495 -0.138 0.890 
P. moluccensis 20 18.08 0.517 5.20 0.566 0.572 
P. amboinensis (retest) 15 16.96 0.258 5.20 0.463 0.644 


Individual- and population-level lateralization was tested for four species of coral reef fishes (P. amboinensis, C. atripectoralis, D. aruanus and P. moluccensis) in a detour test (LIRS 2014) under 
control (around 400 patm CO,) and high CO, (about 1,000 patm) conditions. The sample size (n) and mean number of right turns (X ) out of a total of 10 turns is indicated for each species and 
treatment group (sample sizes represent biologically independent animals). x’ statistics (ind x’) and P values (ind P) are presented for tests of individual-level lateralization. z values (pop z) 
and P values (pop P) are presented for tests of population-level lateralization. P < 0.05 indicates lateralization. ‘retest’ indicates a subset of individuals that underwent a second trial in an 

effort to validate the findings from their first trial. Statistical significance is indicated in bold (a = 0.05). 
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Extended Data Table 5 | Individual- and population-level lateralization of wild and captive-reared A. polyacanthus 


Treatment Species n ind x? ind P X pop z pop P 
A. polyacanthus 
Control (cantered end) 120 219.18  6.28E-08 5.18 0.974 0.33 
A. polyacanthus 
(offset end) 120 170.79 0.001 3.59 -7.76 8.24E-15 
A. polyacanthus 
High CO2 (cantered end) 104 130.58 0.034 4.73 -1.56 0.119 
A. polyacanthus 
offset end 104 152.33 _—0.001 4.14 -4.51 6.56E-06 


The individual- and population-level lateralization of wild (n = 96) and captive-reared (n = 128) A. polyacanthus (mean + s.d. standard length, 20 + 7 mm) was tested in a detour test (AIMS 2015) 
under control (around 400 patm CO,; n= 54 wild, n= 66 captive-reared) and high CO, (about 1,000 patm; n = 42 wild, n = 62 captive-reared) conditions. ‘offset end’ indicates the end of the 
lateralization arena where the barrier was offset by 5 mm to create a situation in which the path around the barrier was shorter if the fish turned left rather than right. The sample size (n) and 
mean number of right turns (X ) out of a total of 10 turns (per arena end) is indicated for each treatment group (sample sizes represent biologically independent animals). x’ statistics (ind x”) and 
P values (ind P) are presented for tests of individual-level lateralization. z values (pop z) and P values (pop P) are presented for tests of population-level lateralization. P < 0.05 indicates 
lateralization. Statistical significance is indicated in bold (a = 0.05). 
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A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
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Software and code 


Policy information about availability of computer code 


Data collection To extract fish behavioural data from videos we recorded of fish in behavioural arenas, we used the commercial software ViewPoint 
(version 3, 20, 5, 83) (details given in the Supplementary Information). 


Data analysis For data analyses, we used a combination of Microsoft Excel (for organizing and summarizing outputs from the ViewPoint video tracking 
software) and R (versions 3.5.0 and 3.5.2, for doing statistical analyses and making most of the figures). Within R, we specifically used 
glmer which is a function of the analysis package Ime4 (version 1.1-20). For data exploration we used mgcv (version 1.8-26). Further 
details are given in the statistical analyses section of the Methods and Supplementary Information. 
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- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


The data necessary to reproduce figures and results in this study are publicly archived in the repository figshare and were made available to editors and reviewers at 
the time of submission: https://figshare.com/s/99f6b8973bfd0bb234e8. We place no restrictions on data availability. 
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Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Data exclusions 


Reproducibility 


Randomization 


Blinding 


We tested the effect of elevated CO2 on predator cue avoidance, swimming activity and behavioural lateralisation. Experiments were 
conducted across three years (August-September 2014, April-July 2015, January 2016), at two locations, and on a total of >900 
individuals from six species. Predator cue avoidance and activity trials were conducted at the Lizard Island Research Station (LIRS) in 
2014 and 2016, and at the Australian Institute of Marine Science (AIMS) in Townsville in 2015. Behavioural lateralisation trials were 
conducted at LIRS in 2014 and at AIMS in 2015. 


We used General Linear Models to test for the effect of elevated CO2 on predator cue avoidance. We built separate models for each 
species x year combination due to the many differences between years. Fish size was included as a fixed effect. Swimming activity 
was analysed using the same general linear modelling procedures as for predator cue avoidance. Population-level lateralisation was 
analysed with a generalized linear random-effects model, and individual-level lateralisation was analysed with a chi-square test 
comparing the observed variance (numerator) to the expected variance (denominator) assuming a normal approximation to the 
binomial distribution. All statistical procedures are described in detail in the Methods section and in the Supplementary Information. 


We used six species of coral reef fish (humbug dascyllus Dascyllus aruanus, ambon damsel Pomacentrus amboinensis, lemon damsel 
Pomacentrus moluccensis, black-axil chromis Chromis atripectoralis, spiny chromis Acanthochromis polyacanthus, white damsel 
Dischistodus perspicillatus) and two species of predators (bluespotted rock cod Cephalopholis cyanostigma and flagtail grouper 
Cephalopholis urodeta). Species were selected based on previously used coral reef fishes (for direct replication) and two species not 
previously used in ocean acidification research (for conceptual replication). Fish were collected from the wild at Lizard Island, 
obtained from a public aquarium (Reef HQ Aquarium, Townsville) and obtained from the northern Great Barrier Reef (by Cairns 
Marine Pty Ltd). Fish were juveniles, sub-adults and adults. No fish were sexed. Fish were exposed to CO2, no additional 
manipulations were applied. 


Sample sizes were based on previous studies and fish availability. Previous studies have typically used 10 fish or less per treatment 
group, which formed the lowest sample size used in this study. When possible, we used a higher sample size (e.g., n = 50) to increase 
statistical power but without depleting any wild fish populations. 


For predator cue avoidance and activity, trials were video recorded and analysed using tracking software. Data from lateralisation 
trials was collected manually in real-time (video recorded at LIRS in 2014, data collected by JS, for AIMS 2015, data was collected by 
GDR, NS and AY). 


Timing: For LIRS 2014, data was collected in August and September (Aug 30-Sept 10, data was collected every day). For AIMS 2015, 
predator cue avoidance trials were conducted on May 21, 22, 27, and 27; activity trials were conducted on May 24, 26, 29, 30, 31, 
and June 1 and 2-9; behavioural lateralization trials were done on April 15-17, April 23, 27, 28, and May 17, 19, 23, and 24. Temporal 
gaps in data collection at AIMS was related to logistical factors, including days off, and availability of experimental equipment and 
personnel. For LIRS 2016, data was collected in January (16-31, data collection every day except for Jan 24 which was a day off). 
Rationale for data collection was to perform the maximum number of experiments per day after the fish had been exposed to CO2 
for an adequate amount of time. 


Spatial scale: For LIRS 2014 and 2016, the experiments were performed at the Lizard Island Research Station, Great Barrier Reef, 
Australia. The fish were collected from around Lizard Island at the northern end of the Great Barrier Reef, Australia (14°40’ S; 145° 
28’ E). For AIMS 2015, the experiments were performed at the Australian Institute of Marine Science, Townsville, Australia. Fish were 
obtained from the Reef HQ Aquarium in Townsville. Additionally, groups of wild fish were collected at Davies Reef (18.8238° S, 
147.6429° E) Great Barrier Reef, Australia (in April 2015) and transported to AIMS. 


Data where excluded in instances where the automated tracking software we used failed, causing erroneous tracks of animal 
movement. 


Activity and predator cue avoidance was replicated across three years. Lateralisation was replicated across two years. 


Fish were placed in the two treatment groups (control and elevated CO2) at random. Other aspects of water chemistry (i.e., the 
water supply used and the temperature it was kept at), lighting, and feeding were kept constant among replicate tanks across the 
two acclimation treatments. 


Complete blinding was not possible since the CO2 dosing system was visible (both visually and auditory) to any observer physically 
present for the experiments, making it obvious which replicate tanks contained the elevated CO2 treatment. However, all activity and 
predator cue avoidance experiments were video recorded and analysed using automated tracking software rather than being 
manually scored by a human observer. 


Did the study involve field work? Yes xX No 
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We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 
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Clinical data 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals The study did not involve laboratory animals 


Wild animals Lizard Island Research Station 2014 
Sub-adult and adult wild fishes were collected in August 2014 from around Lizard Island at the northern end of the Great Barrier 
Reef, Australia (14°40’ S; 145° 28’ E), on SCUBA using hand- and/or barrier-nets and spray bottles of clove oil anaesthetic (mixed 
1:4 with ethanol). Five species of damselfishes were collected: humbug dascyllus (Dascyllus aruanus), ambon damsel 
(Pomacentrus amboinensis), lemon damsel (Pomacentrus moluccensis), black-axil chromis (Chromis atripectoralis), and spiny 
chromis (Acanthochromis polyacanthus). Predatory bluespotted rock cod (Cephalopholis cyanostigma) were collected using hook 
and line. Fish were not sexed. All fish were transported in aerated seawater to LIRS. At the end of the experiments, fish were 
released at the site of capture. 


Australian Institute of Marine Science 2015 

A. polyacanthus juveniles (~10-15 days post-hatching) were corralled into clear containers by SCUBA divers from four distinct 
schools (four breeding pairs) at depths of 8-10 m at Davies Reef (18.8238° S, 147.6429° E), Lizard Island, in April 2015. Fish were 
transported in aerated seawater to AIMS. Fish were not sexed. All fish were euthanized with an overdose of tricaine 
methanesulfonate (MS-222, ca. 500 mg L-1) at the end of the experiments, or at intermittent times through experiments when 
they were sacrificed to take precise length and weight measurements for another study (Sundin et al., 2019). 


Lizard Island Research Station 2016 

Wild fishes were collected in January 2016 from around Lizard Island, as detailed above for LIRS 2014. Adult predatory 
bluespotted rock cod (C. cyanostigma) were caught using hook-and-line, and three damselfish species were caught using clove 
oil spray and hand- or barrier-nets (subadult and adult humbug dascyllus [D. aruanus]; juvenile, subadult and adult spiny chromis 
A. polyacanthus]; subadult and adult white damsel [Dischistodus perspicillatus]). Additionally, larval white damsels were caught 
near the end of their pelagic phase using established light trapping techniques. Fish were not sexed. At the end of the 
experiments, fish were released at the site of capture. 


Field-collected samples Lizard Island Research Station 2014 

Damselfishes were placed in tanks with flow-through seawater (35 PSU) at ambient temperature (23+10C, actual range), divided 
in approximately even numbers between eight identical tanks (25 L each; 3 L min-1 flow-through). Fish were fed to satiation 1-2 
times per day with a commercial pellet food. C. cyanostigma were divided in even numbers between two identical tanks (200 L 
each; 12 L min-1 flow-through) and fed pieces of sardine (Sardinops sagax) every 2-3 d. Tanks were cleaned every 3-4 days. 


Australian Institute of Marine Science 2015 

Juvenile spiny chromis (A. polyacanthus) were obtained from the Reef HQ Aquarium in Townsville, Australia in May. The fish 
were housed in 25 L tanks with seawater recirculating (~3.5 L min-1), connected to one of four independent 200 L sumps, which 
themselves were continuously flushed with fresh seawater at 4-7 L min-1. Fish were fed ad libitum 1-2 times per day using 
commercial aquaculture pellets crushed to a powder and/or Artemia nauplii. Four wild predatory fish (flagtail grouper; 
Cephalopholis urodeta) were freighted to AIMS and split evenly between two tanks after being caught from the northern Great 
Barrier Reef by Cairns Marine Pty Ltd. C. urodeta were fed freshly killed juvenile A. polyacanthus every 1-2 days. Fish were 
exposed to natural water temperatures for the region (quantified using thermal data-loggers sampling every 30 min; iButton, 
Maxim Integrated, San Jose, CA, USA). Temperature declined seasonally from 26.1+0.2°C during the first week of acclimation 
(May 2015) to 24.8+0.5°C during the final week of experiments (June 2015; Table S1). Salinity was regulated through the AIMS 
SeaSim aquarium facility and remained at 35.8+0.15 PSU. Tanks were cleaned weekly. 


Lizard Island Research Station 2016 

Fishes were placed in tanks with flow-through seawater at ambient temperature (29.5+10C, actual range). The damselfishes 
were divided in approximately even numbers between 22 identical tanks (one species per tank, 7-8 tanks per species; 10-25 L 
each and 1-3 L min-1 flow-through, depending on fish size). Fish were fed to satiation 1-2 times per day with a commercial fish 
flake-saltwater slurry (TetraMin Tropical Flakes, Tetra, Blacksburg, VA). C. cyanostigma were divided in even numbers between 


four identical tanks (60 L each; 3 L min-1 flow-through) and fed sardine pieces and freshly killed adult damselfish every 2-3 d. All 
tanks were provided with pieces of PVC piping to act as shelter for the fish. Tanks were cleaned every 3-4 days. 


Ethics oversight All experiments were conducted in compliance with all relevant ethical regulations under approval from the James Cook 
University Animal Ethics Committee in association with the Australian Institute of Marine Science (permit A1924). 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Pyrethroid-impregnated bed nets have driven considerable reductions in malaria- 
associated morbidity and mortality in Africa since the beginning of the century’. The 
intense selection pressure exerted by bed nets has precipitated widespread and 
escalating resistance to pyrethroids in African Anopheles populations, threatening to 
reverse the gains that been made by malaria control’. Here we show that expression of 
asensory appendage protein (SAP2), which is enriched in the legs, confers pyrethroid 


resistance to Anopheles gambiae. Expression of SAP2 is increased in insecticide- 
resistant populations and is further induced after the mosquito comes into contact 
with pyrethroids. SAP2 silencing fully restores mortality of the mosquitoes, whereas 
SAP2 overexpression results in increased resistance, probably owing to high-affinity 
binding of SAP2 to pyrethroid insecticides. Mining of genome sequence data reveals a 
selective sweep near the SAP2 locus in the mosquito populations of three West African 
countries (Cameroon, Guinea and Burkina Faso) with the observed increase in 
haplotype-associated single-nucleotide polymorphisms mirroring the increasing 
resistance of mosquitoes to pyrethroids reported in Burkina Faso. Our study identifies 
a previously undescribed mechanism of insecticide resistance that is likely to be 
highly relevant to malaria control efforts. 


Anopheline mosquitoes are the only genus that is capable of trans- 
mitting human malaria. Targeting the mosquito has proven to be the 
most effective means of reducing the incidence of malaria, and the 
massive scale-up of insecticide-based interventions, most notably 
long-lasting insecticidal nets (LLINs), has driven the considerable 
reductions in malaria cases in Africa in the twenty-first century’. 
However, worryingly, after many years of progress, gains in malaria 
control are now stalling, with an estimated 219 million cases and 
435,000 malaria-associated deaths across Africa in 2017 prompting 
are-examination of the effectiveness of the primary prevention tools”. 
LLINs have proven so effective in preventing malaria because they not 
only provide personal protection to LLIN users by reducing biting by 
Anopheles mosquitoes that feed at night, but also have acommunity 
effect, whereby contact with the insecticide decreases the likelihood 
that mosquitoes will survive long enough for the development and 
transmission of the Plasmodium parasite’. The scale-up of LLINs has 
exerted intense selection pressure on Anopheles vectors to develop 
resistance to pyrethroids, the insecticide class used to treat all LLINs 
and, as a result, the community effect of LLINs is being eroded‘. To 
prevent catastrophic failure of insecticide-based vector control tools, 
the resistance mechanisms must be identified and targeted. Success- 
ful management of insecticide resistance has been demonstrated 
by trials using pyrethroid nets that contain the synergist piperonyl 
butoxide (PBO)°, a potent inhibitor of metabolic resistance caused 
by cytochrome P450s, one of the most widespread, and hitherto 


most potent, resistance mechanisms®”. By blocking this resistance 
mechanism, PBO-pyrethroid nets restore insecticide susceptibility, 
leading to a reduction in malaria cases in areas in which metabolic 
resistance prevails>. 

Although PBO-pyrethroid nets are now replacing standard pyre- 
throid-only nets in many regions across Africa’, not all pyrethroid- 
resistant populations can be effectively targeted by this synergist®’ as 
additional mechanisms are contributing to the resistance phenotype. 
Recently, using a meta-analysis approach of transcriptomic data from 
pyrethroid-resistant A. gambiae populations from across Africa, mul- 
tiple new resistance mechanisms—including upregulation of putative 
insecticide-binding proteins—have been identified”. By screening these 
transcriptomic data from Burkina Faso and Cote d’Ivoire—areas with 
particularly high pyrethroid resistance and low PBO synergism?""—we 
found that one family of binding proteins, the sensory appendage 
proteins, were particularly highly overexpressed in these populations 
(Extended Data Table 1). 

Sensory appendage proteins are members of the chemosensory 
protein (CSP) family—small soluble proteins that are found only in 
arthropods” and that mediate the transport of hydrophobic com- 
pounds through the sensillum lymph”. Eight CSP genes are present in 
the genome of A. gambiae, seven of which are clustered on chromosome 
3R (Extended Data Fig. 1). Four members of this family have previously 
been expressed in vitro and have been shown to bind aromatic com- 
pounds", Members of the CSP family have been shown to be induced 
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Fig. 1|CSP expression profiles. a, Constitutive expression of CSPs in resistant 
and susceptible strains. mRNA localization in antennae, head, legs, midgut, 
Malpighian tubules, reproductive tissue and the remaining abdominal tissues 
(abdominal carcass) in N’Gousso and Tiassalé mosquitoes for each member of 
the CSP family compared to the whole body. b, Induction of CSPs in Tiassalé 
mosquitoes following pyrethroid exposure. Four CSPs show significant 
induction of mRNA expression at different time points after exposure to the 
pyrethroid insecticide deltamethrin in Tiassalé mosquitoes (results for the 
non-induced CSPs are shown in Extended Data Fig. 3a). c, Tissue-specific 
profile of the induction of CSPsin Tiassalé mosquitoes. Tissue-specific 


by insecticide exposure, for example by avermectin in the silkworm 
Bombyx mori’, pyrethroids in the diamondback moth Putella xylos- 
tella’® and noenicitinoids in the whitefly Bemiscia tabaci". 


CSP expression and localization 

To determine whether elevated CSP expression was associated with 
pyrethroid resistance in A. gambiae, we characterized the expres- 
sion of all eight members of the family in females of a multi-insecti- 
cide-resistant laboratory colony that originated from Céte d'Ivoire 
(Tiassalé mosquitoes). We found that two CSPs, SAP2 (VectorBase 


induction for the 4 significantly induced CSPs in the Tiassalé strain, shown 4-h 
after exposure to deltamethrin, each data point shows the exposed compared 
to unexposed tissues from the same generation (see Extended Data Fig. 3b for 
the remaining CSPs). The quantitative PCR data show mean +s.d. of three 
biological replicates. *P< 0.05; **P< 0.01; ***P< 0.001. Statistical significance 
was calculated by an ANOVA followed by Dunnett’s post hoc test; in cases in 
which normalization was not possible, data were analysed using a Kruskall- 
Wallis test followed by a Dunn’s post hoc test inaandb. Inc, significance was 
calculated using atwo-tailed Student’s t-test; Pvalues are included in 
Supplementary Table 2; n=3 for each replicate. 


ID: AGAPOO8032) and CSP6 (AGAPO01303), had significantly higher 
constitutive expression compared to two separate susceptible con- 
trol mosquito lines (Extended Data Fig. 2). We next determined the 
localization of CSP expression in multiple tissues, including body areas 
that are the primary points of mosquito contact with LLINs: the legs, 
head and antennae; the major detoxification tissues: the Malpighian 
tubules and the midgut; the reproductive tissues; and the remaining 
abdominal carcass (Fig. 1a). Six of the eight CSPs, including both SAP2 
and CSP6, were enriched in the legs of both resistant and suscepti- 
ble Anopheles mosquito populations, two of the CSPs were enriched 
in the head (SAP2 and SAP3 (AGAPO08054)); two in the antennae 
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Fig. 2|SAP2 mediates resistance to pyrethroid insecticides. a, Effect of SAP2 
knockdown on mortality in multi-insecticide-resistant Anopheles populations 
in response toa panel of insecticides (right bar) compared with GFP-injected 
controls (left bar). Tiassalé, deltamethrin (blue; GFP, n=5; SAP2,n=5); 
permethrin (brown; GFP, n= 6; SAP2,n=7); a-cypermethrin (dark grey; GFP, 
n=5;SAP2,n=5); DDT (yellow; GFP, n=4; SAP2,n=3), primiphos-methyl (light 
grey; GFP, n=4;SAP2,n=4), bendiocarb (purpleO; GFP, n=9; SAP2,n=4) and 
Banfora, deltamethrin (grey blue; GFP, n= 4; SAP2,n=5). Indicated nnumbers 
are the total number of females used across all replicates. b, Transgenic 
overexpression of SAP2in susceptible G3 mosquitoes reduces mortality after 


(CSP3 (AGAPOO8055) and CSP5 (AGAPO08058)) and one in the abdo- 
men carcass (CSPI (AGAPO08059)). 

We performed atime course to determine whether any of the eight 
CSPs were induced by exposure to pyrethroids in the resistant strain 
(Fig. 1b and Extended Data Fig. 3a) and found that four of the eight CSPs 
were significantly induced by deltamethrin, the pyrethroid that is most 
widely used in LLINs, including the constitutively overexpressed SAP2 
and CSP6, inaddition to SAP3 and CSP4 (AGAPO08062) (Fig. 1b). Examin- 
ing the tissue specificity of the induction 4 h after exposure showed that 
SAP2, SAP1 (AGAPOO8051) and CSP3 were induced in multiple tissues, 
including the legs, whereas CSP4, CSP6 and CSP1 showed induction in 
a single tissue (Fig. 1c and Extended Data Fig. 3b). 


CSPs confer resistance 

Having established that CSPs are highly overexpressed in the append- 
ages of pyrethroid-resistant mosquitoes, and expression of a subset of 
this family is further induced by insecticide exposure, we next silenced 
these CSPs using RNA interference (RNAi) (Extended Data Fig. 4) in 
females from the highly pyrethroid-resistant Tiassalé colony before 
exposing the mosquitoes to a panel of insecticide classes (pyrethroids, 
carbamates and organophosphates) that are widely used in public 
health initiatives. Notably, silencing of SAP2 almost completely restored 
susceptibility to the pyrethroid deltamethrin, and also significantly 
increased the susceptibility to the other two pyrethroids (permethrin 
and a-cypermethrin) that were used in these analyses (Fig. 2a). No 
change in mortality was observed after exposure to the other insecti- 
cide classes, indicating that this mechanism is specific to pyrethroid 
insecticides. We further explored this notable phenotype ina second 
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permethrin exposure. Bars represent control (grey; n=15) and SAP2 
overexpression (white; n =17). Indicated n numbers are the total number of 
females used across all replicates. c, Competitive binding assays of the three 
SAP proteins to three pyrethroid insecticides. Only instances with binding 
shown; no binding was found for SAP3 and SAP1 with permethrin; SAP3 with 
a-cypermethrin; any SAP with bendiocarb or pirimiphos-methyl. 1-NPN, 
N-phenyl-1-naphthylamine. The data are mean +s.d.*P< 0.05; **P< 0.01; 

***P< 0.001; NS, not significant (P> 0.05). Statistical significance ina and b was 
calculated using an ANOVA test followed by a Tukey post hoc test; Pvalues are 
included in Supplementary Table 2. 


multi-insecticide-resistant population from Burkina Faso (Banfora) and 
again found that mortality of the mosquitoes was significantly restored 
after exposure to deltamethrin (Fig. 2a). Knockdown of the other three 
CSPs (SAP3, CSP4 and CSP6) had no effect on mortality to any insecticide 
class, except in the case of CSP6 knockdown, which also significantly 
increased susceptibility to deltamethrin in Tiassalé mosquitoes— 
although not to the same extent as SAP2 knockdown (Extended Data 
Fig. 5). Conversely, overexpression of SAP2in aninsecticide-susceptible 
population (Extended Data Fig. 6) significantly increased pyrethroid 
resistance, directly linking the function of this protein to insecticide 
resistance (Fig. 2b). To check for adverse changes to the life-history 
traits of mosquitoes owing to the disruption of SAP2 expression, we 
recorded the survival, blood-feeding ability and egg productionin SAP2 
RNAi-injected females and found no significant changes compared to 
aGFP-injected control, indicating that the mortality that is observed is 
due toa direct effect of SAP2 on the insecticide and not reduced overall 
fitness (Extended Data Fig. 7). 

To determine a putative mode of action through which achemosen- 
sory protein could confer resistance, we next investigated whether 
these proteins act as pyrethroid-binding proteins. We heterologously 
expressed SAP2 and two closely related CSPs (SAP1 and SAP3) in Escheri- 
chiacoliand carried out competitive binding assays with the fluorescent 
marker N-phenyl-1-naphthylamine and a panel of insecticides (Fig. 2c). 
SAP2 bound to all three pyrethroid insecticides tested: deltamethrin 
(half-maximum inhibitory concentration (IC;,) =3.99 UM); permethrin 
(IC) =4.77 LM); and a-cypermethrin (IC;, = 3.74 1M), but did not bind 
to pirimiphos-methyl (an organophosphate) or bendiocarb (a carba- 
mate). The IC,, values were similar to those found for the most impor- 
tant cytochrome p450s that are responsible for metabolic clearance 
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Fig. 3|SAP2is upregulated and under selection in multiple countries 


across West Africa. a, Points represent significantly differential expression 


of SAP2 in pyrethroid-resistant mosquitoes in two sister species (A. coluzzii 
and A. gambiae) compared with susceptible populations (supplementary 
table 1 froma previously published study”’). Significant (limma; Benjamini- 
Hochberg-corrected P value”, P< 0.05) fold changes are shown in green, 


orange and red. Countries in which SAP2 is significantly upregulated and/or 
involved in the selective sweep are highlighted with a pin. No transcriptomic 


data are available for Guinea. The map was created specifically for this 
manuscript by M. Bernardi. b, Left, schematic of the range of the selective 
sweep found in Guinea, Burkina Faso and Cameroon with a selective sweep 


found across these regions in the Anopheles gambiae 1000 Genomes Project 


highlighted in grey”. Right, the observed iHS signal from Guinea is shown as 
follows (from top to bottom): raw iHS statistics per SNP, normalized by 


of pyrethroid insecticides®’ (deltamethrin: CYP6M2, IC., = 4.24 1M 
and CYP6P3, IC;, = 3.17 uM; permethrin: CYP6M2, IC;, = 8.07 uM and 
CYP6P3, IC. = 6.77 UM)’. The binding of the pyrethroids to SAP2 


had significantly higher specificity (analysis of variance (ANOVA) 


P<0.0001) than the other SAP proteins; however, both SAP1 and SAP3 
bound weakly to deltamethrin (IC,, > 10 pM) and SAP1 bound moder- 


ately to a-cypermethrin (IC,. = 9.02 LM). 


chromosome in allele frequency bins; summarized iHS in windows of 20 kb 
by proportion (p) of SNPs exceeding 2.5 standard deviations”°; and genes 

in this region with SAP2 annotated. The range of the 55 SNPs with this 
haplotype across the three countries is highlighted in yellow. Bottom right, 
a magnified area shows the approximate location of the five haplotype- 
associated SNPs in proximity to SAP2 (Extended Data Fig. 1a). c, Fitted trend 
in frequency of the derived haplotype-associated SNPs in field populations 
from Burkina Faso. A. coluzzii populations collected from Tengrela from 2011 
to 2018 (left) and A. gambiae s.s. samples collected in 2013 and 2015 from 
Bakaridjan, Burkina Faso and 2018 from Tiefora, Burkina Faso (right). Dates 
when samples were sequenced are indicated by a dotted line. SNP data are 
shown in the legend, including the locus and the alternate allele (left) and 
PEST allele (right). 


Having established the importance of SAP2 expression for the resist- 
ance of mosquitoes to pyrethroids ina laboratory setting, and having 
detected the overexpression of this transcript in available transcrip- 
tomic data from multiple resistant West African field populations 
(Fig. 3a and Extended Data Table 1), we next looked for evidence of 
selection at this locus, using data from the Anopheles gambiae 1000 
Genomes Project” and further de novo sequencing. We computed 
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the selection statistics—integrated Haplotype Score (his)”° and 
XP-EHH (Cross Population Extended Haplotype Homozygosity)”—using 
1,142 wild-caught A. gambiae” for the region in which the CSP cluster 
is found onchromosome 3R (Fig. 3b). Evidence for a selective sweep at 
this locus was observed in A. gambiae from Guinea and Burkina Faso. 
The swept haplotypeis present at alow frequency but there is evidence 
of haplotype sharing between Guinea, Burkina Faso and Cameroon. 
We next identified single-nucleotide polymorphisms (SNPs) that are 
diagnostic of the derived haplotype in this selective sweep (Fig. 3b) and 
sequenced this region in individual mosquitoes that were collected in 
the Banfora region of Burkina Faso from 2011 to 2018. This haplotype 
was stable in frequency in A. gambiae sensu stricto (s.s.), indicating 
that this sweep occurred before the collections of mosquitoes by the 
Anopheles gambiae 1000 Genomes Project” in 2012 and pre-dating 
our collection dates (Fig. 3c and Supplementary Table 1). Conversely, 
Anopheles coluzzii samples collected from 2011 to 2018 showed a large 
increase in frequency of the SNPs associated with the sweep (Fig. 3c and 
Supplementary Table 1), and this corresponded with a sharp increase 
in the prevalence of pyrethroid resistance that occurred during this 
time period (Extended Data Fig. 8). 


Discussion 


Our results show that SAP2, a chemosensory protein with no previous 
known function in insecticide resistance, has a key role in conferring 
pyrethroid resistance inthe A. gambiae species complex through the 
binding of insecticides at the first point of mosquito contact with 
bed nets. When SAP2 was overexpressed in an otherwise-susceptible 
background, the transgenic line was more resistant to pyrethroids, 
but the phenotype was less marked than the phenotypes found inthe 
SAP2 knockdown lines, perhaps indicating that SAP2 acts in conjunc- 
tion with other resistance mechanism(s) to provide an additive effect. 
Given its strong binding to pyrethroid insecticides, it is possible that 
SAP2 acts by sequestering the insecticide directly, thus either prevent- 
ing the function of the insecticide on the nervous system or facilitating 
its detoxification. Notably, longitudinal sequencing of field samples 
and available transcriptomic data from wild collections show that this 
mechanism is being selected for in multiple countries in West Africa, 
highlighting its relevance in field settings. Nationwide distribution of 
LLINs has not affected malaria transmission in Burkina Faso, where 
cases continue to rise annually”. Our finding of a potent mechanism 
that has rapidly swept across anopheline populations in this region 
will help to address the contribution of insecticide resistance to the 
persistence of malaria, and the identification of the corresponding 
SNPs will facilitate tracking its spread. Finally, the identification of 
this previously undescribed insecticide sequestration mechanism 
offers the concrete possibility to restore the effectiveness of pyre- 
throid insecticides in natural mosquito populations through the 
identification of new targets for inhibitors that can be incorporated 
into bed nets, in an analogous manner to the incorporation of PBO 
into nets—this may prove to be critical for the elimination of malaria 
across Africa. 


380 | Nature | Vol577 | 16 January 2020 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 
acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-019-1864-1. 


1. Bhatt, S. et al. The effect of malaria control on Plasmodium falciparum in Africa between 
2000 and 2015. Nature 526, 207-211 (2015). 

2. WHO. World Malaria Report (WHO, 2018). 

3. Hawley, W. A. et al. Community-wide effects of permethrin-treated bed nets on child 
mortality and malaria morbidity in western Kenya. Am. J. Trop. Med. Hyg. 68, 121-127 
(2003). 

4. Churcher, T. S., Lissenden, N., Griffin, J. T., Worrall, E. & Ranson, H. The impact of 
pyrethroid resistance on the efficacy and effectiveness of bednets for malaria control in 
Africa. eLife 5, e16090 (2016). 

5.  Protopopoff, N. et al. Effectiveness of a long-lasting piperonyl butoxide-treated 
insecticidal net and indoor residual spray interventions, separately and together, against 
malaria transmitted by pyrethroid-resistant mosquitoes: a cluster, randomised 
controlled, two-by-two factorial design trial. Lancet 391, 1577-1588 (2018). 

6. | Stevenson, B. J. et al. Cytochrome P450 6M2 from the malaria vector Anopheles gambiae 
metabolizes pyrethroids: sequential metabolism of deltamethrin revealed. Insect 
Biochem. Mol. Biol. 41, 492-502 (2011). 

7. Miller, P. et al. Field-caught permethrin-resistant Anopheles gambiae overexpress 
CYP6P3, a P450 that metabolises pyrethroids. PLoS Genet. 4, e1000286 (2008). 

8. Gleave, K., Lissenden, N., Richardson, M. & Ranson, H. Piperonyl butoxide (PBO) 
combined with pyrethroids in long-lasting insecticidal nets (LLINs) to prevent malaria in 
Africa. Cochrane Database Syst. Rev. 8, CD012776 (2017). 

9. Toe, K. H. et al. Do bednets including piperonyl butoxide offer additional protection 
against populations of Anopheles gambiae s.l. that are highly resistant to pyrethroids? An 
experimental hut evaluation in Burkina Fasov. Med. Vet. Entomol. 32, 407-416 (2018). 

10. Ingham, V. A., Wagstaff, S. & Ranson, H. Transcriptomic meta-signatures identified in 
Anopheles gambiae populations reveal previously undetected insecticide resistance 
mechanisms. Nat. Commun. 9, 5282 (2018). 

1. Edi, C. V. et al. CYP6 P450 enzymes and ACE-1 duplication produce extreme and multiple 
insecticide resistance in the malaria mosquito Anopheles gambiae. PLoS Genet. 10, 
e1004236 (2014). 

12. Vieira, F. G. & Rozas, J. Comparative genomics of the odorant-binding and chemosensory 
protein gene families across the Arthropoda: origin and evolutionary history of the 
chemosensory system. Genome Biol. Evol. 3, 476-490 (2011). 

13. Leal, W. S. Odorant reception in insects: roles of receptors, binding proteins, and 
degrading enzymes. Annu. Rev. Entomol. 58, 373-391 (2013). 

14. lovinella, |., Bozza, F., Caputo, B., Della Torre, A. & Pelosi, P. Ligand-binding study of 
Anopheles gambiae chemosensory proteins. Chem. Senses 38, 409-419 (2013). 

15. Xuan,N. et al. Increased expression of CSP and CYP genes in adult silkworm females 
exposed to avermectins. Insect Sci. 22, 203-219 (2015). 

16. Bautista, M. A. M. et al. Evidence for trade-offs in detoxification and chemosensation gene 
signatures in Plutella xylostella. Pest Manag. Sci. 71, 423-432 (2015). 

17. Liu, G. X. et al. Biotype expression and insecticide response of Bemisia tabaci 
chemosensory protein-1. Arch. Insect Biochem. Physiol. 85, 137-151 (2014). 

18. Yunta, C. et al. Pyriproxyfen is metabolized by P450s associated with pyrethroid 
resistance in An. gambiae. Insect Biochem. Mol. Biol. 78, 50-57 (2016). 

19. _TheAnopheles gambiae 1000 Genomes Consortium. Genetic diversity of the African 
malaria vector Anopheles gambiae. Nature 552, 96-100 (2017). 

20. Voight, B. F., Kudaravalli, S., Wen, X. & Pritchard, J. K. A map of recent positive selection in 
the human genome. PLoS Biol. 4, €72 (2006). 

21. Sabeti, P. C. et al. Genome-wide detection and characterization of positive selection in 
human populations. Nature 449, 913-918 (2007). 

22. Pombi, M. et al. Unexpectedly high Plasmodium sporozoite rate associated with low 
human blood index in Anopheles coluzzii from a LLIN-protected village in Burkina Faso. 
Sci. Rep. 8, 12806 (2018). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2019 


Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Mosquito strains 

The populations, each belonging to the A. gambiae species complex, 
used in these experiments were reared and maintained at the Liverpool 
School of Tropical Medicine, under standard insectary conditions at 
27 °C, 70-80% humidity, under a12:12-h photoperiod. Both the Tiassalé 
(A. gambiae sensu lato (s.I.)) strain from Céte d'Ivoire" and the Banfora 
strain (A. coluzzii) from Burkina Faso” are from insecticide-resistant 
populations and have been maintained under insecticide pressure since 
colonization in 2009 and 2014, respectively. The laboratory-maintained 
insecticide-susceptible populations N’Gousso (A. coluzzii)*, Kisumu (A. 
gambiae s.s.) and G3 (A. gambiae s.|.) originate from Cameroon, Kenya 
and The Gambia, respectively. A. gambiae s.|. here refers to a mixture 
of A. gambiae and A. coluzzii. 


RNAi 

PCR was performed on cDNA from Tiassalé mosquitoes using Phu- 
sion High-Fidelity DNA Polymerase (Thermo Scientific) following the 
manufacturer’s instructions and primer sets with a T7 docking sequence 
at the 5’ end of both the sense and antisense primers. Primers were 
designed using NCBI Primer BLAST to produce an asymmetric product 
witha length of 300-600 bp, aGC content of 20-50% and no more than 
three consecutive equivalent nucleotides (Supplementary Table 3).PCR 
was performed with the following cycles: 3 min at 98 °C; 35 cycles of 7s 
at 98 °C; 7s at calculated T,,; and 10 s at 72 °C; followed by a final hold 
at 72 °C for 7 min. PCR products were purified using a Qiagen QIAquick 
PCR Purification kit following the manufacturer’s instructions. dsRNA 
was synthesized using a Megascript T7 Transcription kit (Ambion), with 
a16-h 37 °C incubation, following the manufacturer’s instructions. 
The dsRNA was cleaned using a MegaClear Transcription Clear Up 
kit (Ambion), with nuclease-free DEPC-treated water, heated twice at 
65 °C for 10 min, to elute the sample. The resultant dsRNA product was 
analysed using a Nanodrop spectrometer (Nanodrop Technologies) 
and subsequently concentrated to 3 pg pl using a vacuum centrifuge 
at 37 °C. Presumed mated, 3-5-day-old, non-blood-fed females were 
immobilized ona CO, block and 69 nl dsRNA was injected directly into 
the thorax, between the cuticle plates of the abdomen, underneath 
the wing. As a control, non-endogenous GFP dsRNA was injected at 
the same amount and concentration. 


UAS:SAP2 plasmid construction 

Owing to the difficulty of the transformation of a UAS:SAP2 construct 
into £. coli with a complete open-reading frame (ORF), a fusion gene 
was created that was interrupted by a synthetic intron directly after 
the start codon. Two fragments were amplified separately and fused 
together through 40-bp overlapping nucleotides (20 final nucleotides 
of the synthetic intron and 20 nucleotides downstream of the SAP2 
start codon). The 5’ fragment containing a start codon, intron and 
SAP2 overlap was amplified from pSL-attB-Gyp-UAS14i-Cyp6P3-eYFP 
(A. Adolfi and G.L., unpublished data). In parallel, SAP2 was ampli- 
fied from gDNA of Tiassalé mosquitoes using the following prim- 
ers: forward, 5’-TTCTGAAT TCCATCATGAAACTGTTCGTCGCC-3’ and 
reverse 5’-T TCTCTCGAGT TAT TCCAGCTTGATG-3’. The 3’ fragment for 
fusion was amplified from the SAP2 PCR product with the overlapping 
sequences indicated above. The primers used were: forward, 5’-TTCTG 
AAT TCCATCATGGTAAGTATCAAGGT TACA-3’, reverse 5’-GCGATGGCGA 
CGAACAGTTTCTGTGGAGAGAAAGGCAAAG-3’ and forward 5’-CTTTGC 
CTTTCTCTCCACAGAAACTGT TCGTCGCCATCGC-3’, reverse 5’-TT 
CTCTCGAGTTATTCCAGCTTGATG, respectively. The two fragments 


were fused together by PCR using the outer amplification primers, 
subcloned into pJET1.2 (Thermo Scientific) and inserted into pSL-attB- 
Gyp-UAS14-eYFP-gyp-attB (G.L., unpublished data), which carries an 
inverse attB site for RCME (recombinase-mediated cassette exchange) 
flanking, the 3xP3-YFP transformation marker and multiple cloning 
site through the EcoRI/Xhol restrictions sites, to generate the plasmid 
pUAS:SAP2. Sequence analysis indicated a point mutation within the 
synthetic intron but outside of the splice and acceptor donor sites. 


Transgenic lines 

The CFP-marked A10 Ubi-GAL4 line” was used as the RCME docking 
line to establish the UAS:SAP2 line. The A10 promoter, polyubiquitin 
c(PUBc), has previously been characterized, and shows high levels of 
expression throughout adult male and female mosquitoes”. mCherry 
under the PUBc promoter in the A10 line has fluorescence in the thorax, 
non-sclerotized abdomen and appendages (wing veins, antennae, 
palps, proboscis/labium and legs)” (Extended Data Fig. 6b). Microinjec- 
tions were performed on approximately 200 embryos with 350 ng pI 
of UAS:SAP2 responder plasmid and 150 ng pl" of phiC31 integrase 
helper plasmid pKC40 (gift from L. Ringrose). Subsequently, 38 Fy 
L1 larvae that transiently expressed the YFP marker were obtained, 
reared separately and crossed into G3 mosquitoes of the opposite 
sex. Three YFP-positive F, male adults were identified from screening, 
which were backcrossed with wild-type G3 mosquitoes, and character- 
ized for orientation of the exchange cassette through a diagnostic 
PCR. All males showed identical orientation in the expected genomic 
site. From these F, crosses, a single isofemale line was selected, which 
showed the expected 50% YFP inheritance in F,. This line was further 
characterized after crossing with the parental A10 Ubi-Gal4 driver line 
for correct splicing of the inserted intron (Extended Data Fig. 6c) and 
for insecticide bioassays. 


Bioassays 

At 72-h after injection, 3—5-day-old adult females were exposed toa 
panel of insecticides using standard insecticide impregnated papers 
in WHO tubes”*. In each case, 15-30 adult females were exposed to 
insecticide for 1h and then left in a control tube for 24 hin insectary 
conditions before mortality was scored (minimum biological repli- 
cates, n=3; maximum, n=17). For each experiment, mosquitoes were 
simultaneously exposed to untreated control papers. GFP-injected 
mosquitoes were used as controls for the RNAi bioassays and the GAL4- 
driver line acted asa control for the bioassays on the transgenic strains. 
For the induction experiments, no 24-h recovery period was allowed, 
and mosquitoes were taken at the time point directly after 1-h expo- 
sure; this experiment was only possible on the resistant population, 
owing to mortality in the susceptible control. Bartlett and Shapiro tests 
were used to confirm homogeneity of variance and normality of data, 
respectively. For non-normal data, transformations were performed 
to achieve normality. Analysis of mortality data was done using an 
ANOVA followed by a Tukey post hoc test. Graphs were produced using 
GraphPad Prism 8.0.2. Mortalities and significance levels for figure 
panels are shown in Supplementary Table 2. 


RNA extraction 

RNA was extracted and purified in each experimental set using an 
Arcturus PicoPure RNA Isolation Kit (Thermo Fisher Scientific) fol- 
lowing the manufacturer’s instructions. Whole-body resistant and 
susceptible experiments, and RNAi-knockdown efficiency checks, 
used 5-7, 3-5-day-old adult female mosquitoes. For insecticide induc- 
tion, 3-5-day-old female mosquitoes were snap-frozen pre-exposure 
and at 30 min, 1h, 2h, 4h, 24 hand 48 h after deltamethrin exposure 
and RNA was extracted as above. For tissue experiments, 50-100 
3-5-day-old females had heads, legs, antennae, midgut, Malpighian 
tubules, reproductive tissues (and hindgut), along with the remaining 
abdominal carcass in which fat body cells predominate”, removed 
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with microdissection scissors and tweezers on iced PBS, alongside 
5-7 age-matched whole females from the same cage. The quality of the 
RNA was assessed using a Nanodrop spectrophotometer (Nanodrop 
Technologies). 


Quantitative PCR 

RNA (1-4 pg) from each biological replicate was reverse-transcribed 
using oligodT (Invitrogen) and Superscript III (Invitrogen) according 
to the manufacturer’s instructions. Quantitative PCR (qPCR) was per- 
formed using SYBR Green Supermix III (Applied Biosystems) using an 
MX3005 and the associated MxPro software v.4.10 (Agilent). Primer 
BLAST (NCBI) was used to design primer pairs. Where possible, prim- 
ers were designed to span an exon junction (Supplementary Table 3). 
Each 20-ul reaction contained 10 pl SYBR Green Supermix, 0.3 LM of 
each primer and 1 pl of 2ng pl cDNA. Standard curves for each primer 
set were used to calculate efficiency, using five 1:5 dilutions of cDNA to 
ensure that all primer sets met the MIQE guidelines (90-120%). qPCR 
was performed with the following conditions: 3 min at 95 °C, with 
40 cycles of 10 s at 95 °C, followed by 10 s at 60 °C. Relative expres- 
sion was normalized to two housekeeping genes: EF (AGAPO05128) 
and $7(AGAP010592). Analysis was performed on AC, values”*; Bartlett 
and Shapiro tests were used to confirm homogeneity of variance and 
normality of data, respectively. For non-normal data, transformations 
were performed where possible to achieve normality. Normal data 
were analysed using an ANOVA followed by a Dunnett’s post hoc test, 
non-normal data were analysed using a Kruskall-Wallis test followed 
by aDunn’s post hoc test. Graphs were produced using GraphPad Prism 
8.0.2. All qPCR analysis had three biological replicates and three techni- 
cal replicates within each biological replicate, with the exception of the 
transgenics experiments, which had two biological replicates. Relative 
fold changes and significance levels for each figure panel are shownin 
Supplementary Table 2; primers are shown in Supplementary Table 3 
and SAP2 sequences to demonstrate conservation of the binding site 
are shown in Extended Data Fig. 9. The PCR for sequencing was per- 
formed in two reactions, and sequenced using the primers, region1-F 
5’-CACAACGAT TCGTGGTCACC-3’, region1-R 5’-CACT TCACAACT TGC 
AATGAA-3’, sequenced using region1-R. The second region was similarly 
amplified and sequenced using region2-F 5’-TTCATTGCAAGTTGTGAAG 
TG-3’, region2-R 5’-GCACCAGCTGATCGTTGA-3’, and sequenced using 
region2-F. PCR used Phusion High-Fidelity DNA Polymerase (Thermo 
Fisher Scientific) with a cycle at 98 °C for 30 s followed by 40 cycles 
of 98 °C for 7s, 64 °C for 10 s, 72 °C for 2 min and a final hold at 72 °C 
for 10 min. Gel extraction used the Qiagen Gel Extraction Kit (Qiagen) 
following the manufacturer’s instructions and PCR products were 
Sanger-sequenced at Eurofins Genomics. 


SAP constructs 

SAP1, SAP2 and SAP3 proteins were expressed in bacteria (SAP1 and SAP3 
were expressed as N-terminal His-tagged proteins, and SAP2—whichis 
secreted in the periplasmic space—was expressed as a C-terminal His- 
tagged protein). The bacterial expression constructs used for SAP1 and 
SAP3 expression were generated by directly subcloning the SAPI and 
SAP3 ORFs (as EcoRI-Ndel fragments) from the relevant constructs used 
ina previous study” (provided by A. della Torre) into the pET16bTeV 
vector to generate plasmids pET16TeV.SAP1 and pET16TeV.SAP3, which 
expressed the N-terminal His-tagged versions of SAP1 and SAP3, respec- 
tively. For SAP2 expression, a stepwise cloning strategy into the pET22b 
vector was used, as follows: A. gambiae cDNA (made from RNA from 
mosquito heads of the N’Gousso strain) was used as atemplate for PCR 
amplification of a384-bp SAP2 ORF with Kapa Taq polymerase (Roche) 
using primers SAP2 forward (5’-ATGAAACTGT TCGTCGCCATC-3’) and 
SAP2 reverse (5’-TTAT TCCAGCTTGATGCCCTC-3’) using the following 
conditions: 95 °C for 5 min and 30 cycles of 95 °C for 30 s, 60 °C for 
30s and 72 °C for 30 s, followed by a final extension step at 72 °C for 
5 min. The amplified SAP2 ORF was subcloned into the pGEM-T-Easy 


vector (Promega) and sequence-verified. Plasmid DNA froma positive 
pGEM.SAP2 clone was used as a template for the amplification of the 
mature SAP2 ORF (that is, without the native signal peptide sequence 
to enable cloning in frame with the pET22-encoded bacterial signal 
peptide), using primers NcoSAP2-F 5’-TAACCATGGCCCAGGAGCAGT 
ACACCACC-3’ (to generate an Ncol site in frame with the pelB ORF in 
pET-22b) and XhoSAP2A-R 5’-GTCTCGAGT TCCAGCTTGATGCCCTC 
CTT-3’ (to remove the termination codon, thus enabling C-terminal 
His tagging, and also to introduce an Xhol site). The amplified prod- 
uct was digested with Ncol and Xhol and subcloned into the relevant 
cloning sites of pET-22b. The recombinant plasmid pET22.SAP2His 
was sequence-verified and used for periplasmic protein expression. 


Protein expression and purification 

Expression of SAPI and SAP3 was carried out in BL21-Codon Plus (DE3) 
E. colicells (Agilent) containing pET16TeV.SAP1 and pET16TeVSAP3, 
respectively. A single colony was selected and cultured overnight at 
37 °C in 80 ml of terrific broth containing 100 pg ml ampicillin and 
34 pg mI chloraphenicol. The 80-ml overnight bacterial culture was 
used to inoculate a 4-1 culture of terrific broth containing 100 pg mI 
ampicillin and 34 pg ml chloramphenicol. Cells were cultured at 
37 °C until an optical density at 595 nm (OD,,;) of 0.8. Recombinant 
protein expression was induced by addition of 0.4 mM isopropyl-B-D- 
thiogalactoside (IPTG) and the bacteria were cultured for an additional 
4hat 30 °C. Bacteria were collected by centrifugation (5,000 rpm for 
30 min) and the pellet was resuspended in 1 M NaCl, 50 mM Tris-HCI 
pH 8.0, 0.2 mM EDTA, 0.2 mM PMSF buffer and sonicated for 30 min 
at 4 °C. Following sonication, samples were centrifuged by high-speed 
centrifugation at 20,000g for 30 min at 4 °C, and the supernatant was 
collected and processed ina purification step. 

The supernatant containing soluble SAP1 or SAP3 was purified by Ni?*- 
affinity chromatography as follows: a5.5-ml Ni-NTA agarose column was 
equilibrated with a buffer containing 1 M NaCl, 50 mM Tris-HCl pH 8.0 
and 5 mM imidazole. The supernatant was loaded onto the column 
and washed with equilibration buffer for 10 column volumes. Another 
wash of 10 column volumes was performed with 25 mM NaCl, 50 mM 
Tris-HCI pH 8.0, 10% glycerol and 10 mM imidazole. Bound protein was 
eluted with 5 column volumes of 50 mM Tris pH 8.0, 25 mM NaCl, 10% 
glycerol and 300 mM imidazole. Fractions with an absorbance at 595 
nm (Bradford assay) greater than 3 mg mI‘ were pooled, concentrated 
and buffer-exchanged into 50 mM Tris-HCI pH 8.0, 25 mM NaCl and 10% 
glycerol with dialysis membrane (6-8 kDa) overnight at 4 °C. 

For expression of SAP2, F. coliBL21-Codon Plus (DE3) cells trans- 
formed with pET22.SAP2His were cultured overnight at 37 °C in 20 ml 
terrific broth containing 100 pg mI ampicillin and 34 pg mI chloram- 
phenicol. The 20-ml overnight £. coli culture was used to inoculate a 2-1 
culture of terrific broth containing 100 pg ml“ ampicillin and 34 pg mI 
chloramphenicol. Cells were cultured at 37 °C until an OD,,,; of 0.8. Pro- 
tein expression was induced by addition of 0.1 mMIPTG and the bacteria 
were cultured for an additional 16-18 hat 25 °C. Cells were collected by 
centrifugation (5,000 rpm for 30 min) and the periplasmic E. colifrac- 
tion was extracted by osmotic shock as previously described”. In brief, 
collected cells were suspended ina hypertonic solution of 30 mM Tris, 
20% w/v sucrose, 1mMMEDTA, pH 8.0 (25 ml per 1-1 culture) and incubated 
for 30 min at 4 °C. Then, cells were centrifuged at 20,000g at 4 °C for 
20 min and the supernatant was collected. Cells were resuspended in 
ahypotonic solution of 5mM MgSO, (25 ml per 1-I culture), incubated 
for 30 minat4 °C, followed by an additional centrifugation at 20,000g 
at 4 °C for 20 min, and the supernatant was collected. The supernatant 
from the hypotonic solution was used for the purification step. 

The periplasmic solution containing secreted SAP2 was purified 
by Ni?*-affinity chromatography as follows. A 6.5-ml Ni-NTA agarose 
column was equilibrated with 20 mM Tris, 300 mM NaCl, 40 mM imi- 
dazole, pH 8.0. The clarified osmotic shock fluid (supernatant from 
the hypotonic solution) was loaded onto the column and washed with 


equilibration buffer for 15 column volumes. Bound protein was eluted 
with 5 column volumes of 20 mM Tris, 300 mM NaCl, 500 mM imida- 
zole, pH 8.0. Fractions with an absorbance at 59 nm (Bradford assay) 
greater than 0.5 were pooled, concentrated and buffer-exchanged 
into 50mM Tris-HCI pH 8.0, 25mM NaCl, 10% glycerol with dialysis 
membrane (6-8kDa MWCO) overnight at 4 °C. 


1-NPN binding competition assays 

The affinity of various insecticides for SAP1, SAP2 and SAP3 was meas- 
ured indirectly by competitive binding assays, which determined the 
displacement of the ligands from the SAPs by the fluorescent probe 
1-NPN. Purified recombinant SAP proteins were mixed with different 
concentrations of insecticides (in methanol) and 1-NPN (in methanol) 
in a final volume of 200 ul of 20 mM Tris-HCI, pH 8.0, 100 mM NaCl 
(containing 1.5% methanol). The final concentration of recombinant 
SAP protein and 1-NPN in the assays was 10 uM and 5 uM, respectively. 
Insecticide concentration ranged from 0.0625 to 20 uM. The probe 
was excited at 337 nm and emission spectra were recorded between 
380 nm and 460 nm (peak emission in the presence of recombinant 
SAP is at 386-388 nm). Emission spectra were recorded on an Infinite 
M-200 fluorimeter (Tecan Trading) using black 96-well plates (Greiner 
Bio-One). Graphs were produced using GraphPad Prism 8.0.2. A non- 
linear fit using logged concentrations were used to fit best-fit lines and 
calculate the IC, values. The calculated log(IC;,) values and associated 
standard errors were used to fit an ANOVA to compare the binding 
specificity of the three proteins. 


Microarray data 
Microarray data were retrieved from the IR-TEx application”? (https:// 
www.|stmed.ac.uk/projects/ir-tex). 


Field collections 

Field collections of A. gambiae s.|. were performed by colleagues at 
the Liverpool School of Tropical Medicine in collaboration with the 
Centre National de Recherche sur le Paludsime (CNRFP) in Banfora 
district, Burkina Faso. The mosquitoes used in this study originated 
from Tengrela, (10° 38’ 7.53” N; 4° 48’ 48.35” W) in 2011, 2012, 2014, 
2016 and 2018, Bakaridjan (10° 24’ 26.34” N; 4° 33’ 44.78” W) in 2013 
and 2015 and from Tiefora (10° 37’ 54.02” N; 4° 33’ 22.85” W) in 2018. All 
gDNA extractions were subject to SINE PCR® to determine the species 
of each mosquito before sequencing. 


Scan for natural selection 

We computed iHS”° and XP-EHH” statistics on phased sequences for 
regions 1.5 Mb upstream and downstream of the chemosensory locus 
for allavailable A. gambiae West African mosquito populations found 
in phase 2 of the Anopheles gambiae 1000 Genomes Project using 
scikit-allel v.1.1.10 (https://doi.org/10.5281/zenodo.822784). A peak 
was apparent in iHS in the Guinean A. gambiae population (highest 
value at 3R: 4,845,138) anda smaller peak was present in Burkina Faso, 
and a XPEHH peak was present in Cameroon. 


Clustering analysis and SNP panel 

We carried out haplotype clustering analysis around the chemosensory 
locus and identified 13 haplotypes that differed by fewer than 20 SNPs 
across a window 25 kb upstream and downstream of the limits of SAP2. 
This is a firm proxy for identity by descent” in an organism with 7~ 0.01, 
indicative of selection acting at this region. These 13 shared haplotypes 
were observed in Guinea, Burkina Faso and Cameroon. 

These 13 haplotypes were used to define 55 SNPs with high F;; 
between this cluster and the wild-type A. gambiae populations. ASNP 
panel screen was designed to encompass a range of these SNPs showing 
the highest difference between wild-type and cluster SNPs, primers for 
the two panels were as follows: forward, CAAGCAT TGCGCCATCGT; 
reverse, GAGAAGATGATACTGAGCGG. PCR was performed on gDNA 


from individual A. gambiae from Bakaridjan in 2013 (42 individuals) 
and 2015 (50 individuals) and Tiefora in 2018 (40 individuals). SNP 
genotypes from genomic sequence data from individual A. coluzzii 
samples from Tengrela from 2011 (21 individuals), 2012, 2014 and 2016 
(72 individuals for each year) were provided by the Broad Institute; 
the PCR panel was carried out on samples from the same site in 2018 
(20 samples). The panel was analysed using Phusion High-Fidelity 
DNA Polymerase (Thermo Fisher) following the manufacturer’s instruc- 
tions and the following cycles were used: 98 °C for 30 s, 35 cycles 
of 98 °C for 7s, 65 °C for 10 s, 72 °C for 15s and a final hold at 72 °C 
for 15 min. The resultant product was purified using a QiaQuick Gel 
Extraction Kit (Qiagen) following the manufacturer’s instructions and 
sent for Sanger sequencing at Eurofins Genomics using the reverse 
primer. SNPs were identified using Benchling (Biology Software; 
https://benchling.com), the resultant change in minor allele frequency 
was calculated and a trend line was fitted to each SNP using GraphPad 
Prism 8.0.2. 


Phylogeny 

cDNA sequences for each of the CSPs were retrieved from VectorBase® 
and aligned using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/ 
clustalo/). These data were loaded into Mega 7.0” and a maximum 
likelihood tree with 1,000 bootstraps was performed. 
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Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All data analysed during this current study are described inthe Article, 
Extended Data Fig. 1-9, Extended Data Table 1 and the Supplementary 
Information, or are available from the corresponding authors upon rea- 
sonable request. Source Data for Figs. 1, 2 are provided with the paper. 
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Extended Data Fig. 1| Chemosensory protein cluster. a, Schematic of the 
region surrounding the shared haplotype block found in the Anopheles 
gambiae 1000 Genomes Project data with all chemosensory proteins inthe 
cluster highlighted in yellow. Genes displayed in order of appearance, from left 
toright, areas follows: AGAPO08046, AGAP013713, AGAPO08047, 
AGAP008048, AGAPO08049, AGAPO08050, AGAPO08051 (SAPI), 
AGAPO0080532 (SAP2), AGAPO08053, AGAPO08054 (SAP3), AGAPOO8055 
(CSP3), AGAPOO8056, AGAP029127 (CSP5, previously AGAPOO08058), 


AGAPO08059 (CSP1), AGAPOO8060, AGAPO08061 and AGAPO08062 (CSP4). 
b, cDNA bootstrap consensus tree inferred from1,000 replicates using the 
maximum likelihood method; the percentage of replicate trees with the 
associated clustering are shown next to the branches. Yellow indicates the 
sensory appendage proteins, orange the remaining chemosensory proteins in 
the 3R cluster and black dotted lines show CSP6, whichis located on 2R. 
Alternative isoforms are represented with ‘-RX’, with‘X’ proceeding 
alphabetically dependent on number of splice variants. 
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Extended Data Fig. 2| Overexpression of the CSP family in a multi- 
insecticide-resistant Anopheles population. Left, mean relative fold change 
of each CSP in Tiassalé mosquitoes (blue) compared with the susceptible 
control N’Gousso mosquitoes (grey) as determined by qPCR. Right, mean 
relative fold change of each CSP in Tiassalé mosquitoes (blue) compared with 
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the susceptible Kisumu population (grey). Points show three biological 
replicates. Dataare mean +s.d.*P< 0.05; **P<0.01;***P< 0.001. Statistical 
significance was calculated by an ANOVA followed by Dunnett’s post hoc test; 
Pvalues are included in Supplementary Table 2. 
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Extended Data Fig. 3| Expression levels of non-induced chemosensory to deltamethrin. Data are mean +s.d. of three biological replicates. *P< 0.05; 
proteins after exposure to deltamethrin in Tiassalé mosquitoes. “P< 0.01;***P< 0.001. Statistical significance was calculated by an ANOVA 
a, Expression levels of the remaining four CSPs at various time points after followed by Dunnett’s post hoc test. If the data were non-normal, data were 
exposure to deltamethrin in the multi-insecticide-resistant Tiassalé analysed using a Kruskall—Wallis test followed by a Dunn’s post hoc test; 


population. b, Tissue-specific induction of these four CSPs 4-h after exposure Pvalues are included in Supplementary Table 2. 
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Extended Data Fig. 4 | Efficacy of RNAi. MRNA knockdown of whole female mosquitoes 72 h after injection compared with GFP-injected controls. Dataare 
mean+s.d. of three biological replicates. 
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Extended Data Fig. 5 | Phenotype of other induced CSPs toa panel of c, Effect of attenuation of dsCSP6 on mortality after insecticide exposure in 
insecticides. a, Effect of attenuation of dsSAP3 on mortality after insecticide Tiassalé mosquitoes (right bars; deltamethrin, n=6; permethrin, n=4; 
exposure in Tiassalé mosquitoes (right bars; deltamethrin, n=4; permethrin, a-cypermethrin, n= 4; DDT, n=3; pirimiphos-methyl, n=4; bendiocarb, n=5) 
n=5;a-cypermethrin, n=3; DDT, n=3; pirimiphos-methyl, n=3; bendiocarb, compared with dsGFP-injected controls (left bars, green patterned; 
n=4) compared with dsGFP-injected controls (left bars, green patterned; deltamethrin, n=5; permethrin, n=5; a-cypermethrin, n=5; DDT, n=4; 
deltamethrin, n=5; permethrin, n=5; a-cypermethrin, n=5;DDT,n=4; pirimiphos-methyl, n=4; bendiocarb, n=8). Analysis of mortality data was 
pirimiphos-methyl, n=4; bendiocarb, n= 8).b, Effect of attenuation of dsCSP4 done using an ANOVA followed by a Tukey post hoc test; n.s indicates anon- 
on mortality after insecticide exposure in Tiassalé mosquitoes (right bars; significant change in mortality; *P< 0.05. dSCSP6 Lmortatity = 11.7-31.6%, 
deltamethrin, n=3; permethrin, n=3; a-cypermethrin, n=3; DDT, n=3; P=0.0474. N indicates the number of individual mosquitoes used for 
pirimiphos-methyl, n =3; bendiocarb, n=3) compared with dsGFP-injected phenotyping; points show the number of bioassay replicates per group. Data 
controls (left bars, green patterned; deltamethrin, n=5; permethrin, n=5; aremeants.d. 


a-cypermethrin, n=5; DDT, n=4; pirimiphos-methyl, n=4; bendiocarb, n=8). 
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Extended Data Fig. 6 | Characterization of SAP2 inthe transgenic line. 

a, MeanmRNA expression after SAP2 overexpression in the SAP2 x A10 
transgenic line (n= 2) compared with SAP2 expression in the A10 x G3 control 
(n=3). Dataare mean +s.d. and points show each biological replicate. 

b, mCherry under the PUBc A10 promoter demonstrating (i) ubiquitous 
expression; (ii) expression in the head; and (iii) expression in the legs 

as previously shown”; these results were tested across more than100 


B.iii 


independent mosquito screens. c, Intron splicing confirmed by PCRin A10 x 
SAP2 and negative control Al0 mosquitoes compared with plasmid DNA of 
pUAS:SAP2. The size of the PCR product with and without the synthetic intron 
is 647 bp and 534 bp, respectively. MW, 100-bp DNA ladder. n=2 A10 x SAP2 
samples andn=2A10 control samples (each sample consists of a pool of 5 
4-day-old, unfed females) were tested and experiments were repeated in2 
PCRs. 
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Extended Data Fig. 7 | Effect of SAP2 RNAi injection on the fitness of Tiassalé shading indicates females with eggs, light shading without; P= 0.4382). 


mosquitoes. a, Longevity of SAP2 RNAi (dsSAP2)-expressing mosquitoes (iii) Mortality after a blood meal (dark shading are females that are alive after a 
(black) compared with dsGFP-expressing control mosquitoes (green). blood meal, light those that are dead; P= 0.0052). (iv) Blood feeding 
Nindicates the number of individual mosquitoes used in each group; n.s proportions (dark shading are blood-fed females, light non-blood-fed; 
represents P= 0.113 as calculated by atwo-sided Mantel-Cox test. b, Life- P=0.3257). Numbers show total numbers of individual females in each group. 
history traits of dsSAP2-injected (black) and dsGFP-injected (green) females. Significance in (i) was calculated by atwo-tailed Mann-Whitney U-test (n.s 

(i) Number of eggs in each group 72-h after a blood meal (the median and represents P= 0.0657); significance in (ii), (iii) and (iv) was calculated using a 


interquartile range are shown). (ii) Proportion of females with eggs (dark X test. **P<0.01. 
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Extended Data Fig. 8 | Mortality of A. coluzziifield populations. Temporal 
plot of mortality from 2011 to 2018 of A. coluzzii mosquitoes to 0.05% WHO 
tube deltamethrin exposure. A is the posterior median change in mortality 
from 2011 to 2018. N indicates the number of experiments included (minimum 
sample size for any given data point is 14). p indicates the posterior probability 
that resistance (the proportion of posterior samples for which the April 2018 
mean exceeds the corresponding value inJanuary 2011) has increased over the 


2015 


2017 2019 


Year 


time period. The blue line indicates the posterior median of a logistic model fit 
to binomial test results; the two parameters of the logistic function were 
assigned using uninformative (Cauchy(0, 1)) priors. The model was fitted using 
Stan*® with 4 chains and 800 iterations per chain (400 of which were discarded 
as burn-inin each case); all parameters had Rhat <1.1, indicating convergence. 
The shading indicates the 90% predictive interval on the mean. Data and figure 
were provided by H. Toé, B. Lambert and T. Churcher. 
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Extended Data Table 1| Expression of the CSP family in western Africa 


Dataset Banfora Tiefora13 Tiefora14 Tengrela VK72011 VK72012 
AGAP008051-RA SAP1 3.3 : 2 2 
AGAP008052-RA SAP2 
AGAP008054-RA SAP3 
AGAP008055-RA CSP3 
AGAP008058-RA CSP5 
AGAP008059-RA CSP1 
AGAP008062-RA CSP4 


AGAP001303-RA 
AGAP001303-RB 


Country Burkina Faso Burkina Faso Burkina Faso Burkina Faso Burkina Faso Burkina Faso Burkina Faso 
Species An. coluzzii An. gambiae An. gambiae An. coluzzii An. coluzzii An. coluzzii An. coluzzii 
Exposure Status Unexposed Unexposed Unexposed Unexposed Deltamethrin Deltamethrin Unexposed 
Susceptible Comparator N'Gousso Kisumu Kisumu Mali Mali N'Gousso N'Gousso 


Dataset Bouake M'Be Tiassale TiassaleMali TiassaleOkyero Tiassale2011 Nkolondom 
AGAP008051-RA SAP1 726 y / 1.106 


AGAP008052-RA SAP2 10.486 5 1.360 

AGAP008054-RA SAP3 2 0.822 

AGAP008055-RA CSP3 1.091 

AGAP008058-RA CSP5 1.038 

AGAP008059-RA CSP1 1.067 0.844 

AGAP008062-RA CSP4 1.076 0.990 

AGAP001303-RA CSP6 0.848 1.130 

AGAP001303-RB CSP6 0.850 1.065 

Country Cote D'Ivoire Cote D'Ivoire Cote D'Ivoire Cote D'Ivoire Cote D'Ivoire Cote D'Ivoire Cameroon 
Species An. coluzzii An. coluzzii An. coluzzii An. coluzzii An. coluzzii An. coluzzii An. gambiae 
Exposure Status Deltamethrin Deltamethrin Deltamethrin Unexposed Unexposed Unexposed DDT 
Susceptible Comparator N'gousso N'gousso N'gousso Mali Okyero N'Gousso Kisumu 


Dataset Garre Youande KovieOkyero KovieMalanville 
AGAP008051-RA 


AGAP008052-RA SAP2 
AGAP008054-RA SAP3 
AGAP008055-RA CSP3 
AGAP008058-RA CSP5 
AGAP008059-RA CSP1 
AGAP008062-RA CSP4 
AGAP001303-RA CSP6 


AGAP001303-RB 


Country Cameroon Cameroon Cameroon Equatorial Guinea Togo Togo 
Species An. coluzzii An. coluzzii An. gambiae An. coluzzii An. coluzzii An. coluzzii 
Exposure Status DDT DDT Bendiocarb Deltamethrin Unexposed Unexposed 
Susceptible Comparator N'Gousso N'Gousso N'Gousso N'gousso Okyero Malanville 


Microarray fold-change data from microarray datasets were retrieved from IR-TEx" across western Africa for A. coluzzii and A. gambiae populations for each member of the CSP family. Fold 
changes showing significant overexpression in the respective datasets are highlighted in green, yellow indicates underexpression and white backgrounds show no significant change. SAP2 is 
shown in bold. The country of origin, species and exposure status of the resistant population are given below each table, as well as the susceptible population used as for comparison. 
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The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 
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A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
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Sample size In all cases experiments were carried out in at least biological triplicate and in the case of qPCR and the binding assays technical triplicate, as 
previously published in the MIQE guidelines. In each case a minimum number of mosquitoes were included in biological replicate in qPCR 
analysis (5-10) to ensure enough RNA was produced. For each bioassay over 20 individuals per WHO tube for diagnostic assay, with n greater 
than or equal to 3 following standard procedure specified by WHO's Testing Procedures document stating around 120-150 mosquitoes 
minimum should be used in these assays (combined exposed+control). In the case of field sample sequencing, we were constrained by the 
number of individuals available of the correct species in historical samples. 


Data exclusions No data was excluded from the analysis 


Replication As described above all experiments were performed with at least three biological replicates and were required, at least three technical 
replicated. All experiments were successful; however only 2 samples were available for qPCR of the SAP2 over-expressing line. 


Randomization _ This is not relevant as mosquitoes are reared in large cases, often with over 200 females so selection of females is inherently random. 


Blinding Blinding is not possible in this study design as it is necessary to know which tubes contain insecticides and the relevant meta-data for the 
sequencing experiments. 
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Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
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Palaeontology |_| MRI-based neuroimaging 
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Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Anopheles gambiae sl female mosquitoes, 3-5 days old 

Wild animals No wild animals were involved in this study. 

Field-collected samples No mosquitoes were directly collected from field in this study. 
Ethics oversight No ethical approval is needed for use of mosquitoes 
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Homo erectus is the founding early hominin species of Island Southeast Asia, and 
reached Java (Indonesia) more than 1.5 million years ago’. Twelve H. erectus calvaria 
(skull caps) and two tibiae (lower leg bones) were discovered from a bone bed located 
about 20 m above the Solo River at Ngandong (Central Java) between 1931 and 1933**, 
and are of the youngest, most-advanced form of H. erectus’ *. Despite the importance 
of the Ngandong fossils, the relationship between the fossils, terrace fill and ages have 
been heavily debated? “. Here, to resolve the age of the Ngandong evidence, we use 
Bayesian modelling of 52 radiometric age estimates to establish—to our knowledge— 
the first robust chronology at regional, valley and local scales. We used uranium-series 
dating of speleothems to constrain regional landscape evolution; luminescence, 
“Oargon/*argon (*°Ar/’Ar) and uranium-series dating to constrain the sequence of 
terrace evolution; and applied uranium-series and uranium series—electron-spin 
resonance (US-ESR) dating to non-human fossils to directly date our re-excavation of 
Ngandong*”’. We show that at least by 500 thousand years ago (ka) the Solo River was 
diverted into the Kendeng Hills, and that it formed the Solo terrace sequence between 
316 and 31 ka and the Ngandong terrace between about 140 and 92 ka. Non-human 
fossils recovered during the re-excavation of Ngandong date to between 109 and 

106 ka (uranium-series minimum)” and 134 and 118 ka (US-ESR), with modelled ages 
of 117 to 108 thousand years (kyr) for the H. erectus bone bed, which accumulated 
during flood conditions*”. These results negate the extreme ages that have been 
proposed for the site and solidify Ngandong as the last known occurrence of this long- 
lived species. 


Our current understanding of H. erectus in Asia largely derives from 
evidence from the Solo River region of central Java in the Indonesian 
archipelago’. However, this region presents great challenges to con- 
structing solid chronologies for hominin occupation, evolution and 
dispersal’. These problems relate to finding appropriate materials 
for dating, confusion over the location of previous excavations and 
find spots, alack of direct association between the fossils and material 
being dated, taphonomic differences within the faunal assemblages, 
reworking of surrounding fluvial deposits and fossils and the leaching of 


uranium from the fossils being dated. Nowhere are these complications 
and misperceptions more apparent than at the site of Ngandong”>”°, 

In1996, late-Pleistocene age estimates from uranium-series (U-series) 
and ESR dating were reported for mammalian bone within the 20-m 
terrace fillat Ngandong and other nearby localities’. These unexpect- 
edly young results (corresponding to ages of 53 to 27 kyr) triggered 
debate regarding the taphonomy of the Ngandong fossil assemblage 
and the sedimentological context of the dated material”. More dating 
followed: direct gamma-spectrometric’° Th/U dating of H. erectus 
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Fig. 1| Location of Java with H. erectus sites and key study sites onthe 
terraces. a, Java and study sites, and the location of Javain the Indonesian 
archipelago (inset). b, The location of the key study areas; the Southern 
Mountains, Kendeng Hills (Kendeng anticlinorium), Solo River terraces and the 
H. erectus sites of Sangiran, Trinil, Sambungmacan and Ngandong, as wellas the 
other Javanese sites (Jigar and Punung). The scale bar for the elevation changes 


cranial bone from Ngandong and Sambungmacan, giving ages of 40 
and 60-70 kyr, respectively”; U-series and ESR dating of mammalian 
bone from Jigar | (downstream of Ngandong), which gave an age of 
143 +20 or -17 kyr; and *°Ar/*’Ar dating of pumice hornblende from 
Ngandong Il and Jigar, giving an age of 546 +12 kyr’ (Fig. 1; uncertain- 
ties are defined in figure legends). 

To address these chronological inconsistencies, we applied a regional 
approach to establishing a chronology for the re-excavations that we 
conducted at Ngandong (Fig. 2). First, to establish alandscape context, 
we used U-series dating of speleothems to constrain the tectonic uplift 
of the Southern Mountains that caused the Solo River to divert north- 
ward to flow through the Kendeng Hills and form the Solo River strath 
terrace sequence (Fig. 3, Extended Data Fig. 1). Second, to establish a 
terrace context, we applied red thermoluminescence, post-infrared 
infrared-stimulated luminescence (pIR-IRSL) and *°Ar/*’Ar dating tech- 
niques to terrace sediments at localities covering the entire incision 
sequence: Kerek (upper), Padasmalang (middle), Ngandong (lower), 
Sembungan terrace (lower, contemporaneous with Ngandong), Ngle- 
back (lowermost) and Menden terrace (outside of the Kendeng Hills and 
the Solo terrace sequence). We used the terrace chronology to identify 
the incision phases and constrain the age of the Ngandong terrace by its 
relative position in the terrace sequence (Extended Data Fig. 2). Third, 
we established a fossil context by re-excavating the Ngandong bone bed 
(Extended Data Fig. 6, Supplementary Information section 2) to better 
understand the sedimentology of the terrace fill, the taphonomy of the 
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relates to bothaand b. Topographical map redrawn from the US Geological 
Survey’s Earth Resources Observation and Science Center (USGS EROS).c, A 
section of the Solo River system, from Kerek village in the south to Menden 
village in the north, showing our key study sites (including Ngandong, 
Sembungan, Padasmalang, Nglebak, Kerek and Menden). USGS Landsat image 
with digital elevation model”. 


Ngandong fossil assemblage and to provide context-supported datable 
material. We applied laser-ablation U-series analyses” to 8 bovid teeth 
and 15 mammalian long bones (Supplementary Information section 7) 
and coupled US-ESR analyses to 3 additional fossil bovid teeth (Sup- 
plementary Information section 6). 

According to the U-series-derived speleothem chronology (Sup- 
plementary Table 13), the Gunung Sewu section of the Southern 
Mountains had been uplifted and the Solo River had been diverted by 
>500 ka, which places a maximum age on the formation of the terrace 
sequence. Consistent with this, the red thermoluminescence, pIR- 
IRSL and *°Ar/**Ar chronologies indicate that the sequence of strath 
terraces was formed between 316 + 28 and 31+ 6 ka (with a maximum 
age of 358 + 26 kyr), and are all chronologically consistent with their 
elevation (Extended Data Figs. 2, 8, Supplementary Figs. 1, 2, Supple- 
mentary Tables 6, 7, 12). 

Our Ngandong excavations yielded a composite cross-section with 
five lithofacies comprising the Ngandong terrace fill: these lithofacies 
could be related to terrace-fill descriptions from excavations at the site 
in the 1930s, which yielded the H. erectus calvaria and tibiae*” (Fig. 2, 
Extended Data Fig. 5, Supplementary Table 1, Supplementary Informa- 
tion section 2). All H. erectus fossils recovered in the original excavations 
were associated with facies C (gravelly sand bars). Our excavations 
yielded 867 in situ disarticulated non-hominin fossils—mostly isolated 
teeth and bone fragments, but largely complete elements were also 
observed?” (Extended Data Fig. 6c, f, g). Usually, these fossils were 
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Fig.2|Cross-sections of Ngandong site, showing the stratigraphic context labelled on the plan view map, following a previous publication*. I-XI represent 
and the location of our dating samples. Depositional facies A-E, exposed in the calvaria, and A represents one of the tibiae. Facies A is commonly one or two 


excavation bulkheads with collection points for pIR-IRSL samples (IND codes), marl-cobbles thick, but also forms thicker bars. Facies B is moderately well- 
US-ESR specimens and U-series diffusion-absorption-decay specimens (four- sorted, fine- to medium-grained sand with shallow-trough and ripple cross- 


digit specimen numbers) (Supplementary Tables 6, 9, 10). Dated fossils not laminations. Facies C is very coarse-grained, very poorly sorted, crudely cross- 
shown in the figure are 1026 and 1110 from excavation pit C (C onthe plan-view bedded and partially carbonate-cemented sand and pebbly granule 

map); 1075, 1076 and 1095 from pit L (L); and 1088 from pit K (K). Distances conglomerate. Facies D is moderately well-sorted, medium- to coarse-grained, 
between pits east of the reserve are indicated on the grey dashed columns. The troughcross-bedded sand. Facies Eis sandy gravelly muddy diamicton, which 
facies are present on the west and east sides of the archaeological reserve, but fills channels cut into facies B—D (and into which the bedding inclines). 

in different relative abundances. The Ngandong bone bed in facies C was Extended Data Figure 6 and Supplementary Information section 3 provide 
generally excavated away in 1931-1933, but the underlying bone bed in facies A additional lithological information. Scale bars apply to vertical and horizontal 
appears to be widely present at the site. The Ngandong H. erectus finds are distances. Cl, contour interval. 


Nature | Vol577 | 16 January 2020 | 383 


Article 


Speleothem age 


Landscape 
context 


~ Gunung Sewu 


Solo River valley Southern Mountains 


Model 


4 Red TL <—Maximum age 
-O- US-ESR —» Minimum age 


Terrace age 


Kerek Kendeng Hills 


& U-series “ 
~} pIR-IRSL pa 
@ *°Ar/9Ar 


Terrace 
context 


358 + 26 
316 + 36 
Padasmalang 


212+70 
161 + 54 


Fossil 
context 


Sembungan 


Ngandong 


133 + 42 
137 + 16 
169 + 14 


200 


300 
Age (kyr) 


Fig. 3|Aregional chronology for the Ngandong evidence, summarizing the 
results of our approach. Composite chronology for the landscape, terrace and 
fossil context for the Ngandong site. Allage ranges and errors are presentedin 
kyr. Landscape context: constraining the evolution of the landscape from 
U-series dating of speleothems (n=5) from the Gunung Sewu region of the 
Southern Mountains at an altitude of between 340-600 m (mean ages, withs.d. 
at 2ouncertainties). Terrace context: from the red thermoluminescence and 
pIR-IRSL dating of quartz and feldspars, and *°Ar/**Ar dating of hornblende 
from the terrace deposits (mean ages withs.d. at louncertainties). Five 
terraces were dated in the region, from the oldest (Kerek; upper) down through 
Padasmalang (middle), Ngandong (lower), and Sembungan (lower) to Nglebak 
(lowermost), the youngest (n= 21). Relative terrace elevation in metres above 
mean sea level is shown, but is not to scale. Fossil context: this was provided by 
the excavations at Ngandong, in which four dating techniques were applied to 


encased in a 10-25-cm-thick lens of poorly sorted fossil-rich muddy- 
pebble conglomerate that was deposited in a sediment-charged river 
(Fig. 2). Most of the 232 specimens that we analysed were broken before 
deposition, but more than 90% of them exhibit only minor abrasional 
rounding (Extended Data Fig. 7). Surface cracking and exfoliation indic- 
ative of substantial surface exposure is rare. No fossils show evidence 
of being reworked from a considerably older formation. Fossils from 
facies A and facies C havea similar taphonomy but fossils from facies C 
are larger, and more complete. No stone tools were discovered in this 
excavation. However, a concentration of 89 artefacts (dated to about 
130 ka) was discovered ina contemporaneous Sembungan excavation 
located upstream of Ngandong (Extended Data Fig. 3, Supplementary 
Information section 3), and—when combined with the Flephas speci- 
men discovered in the older Menden terrace, located downstream of 
Ngandong (Extended Data Fig. 4, Supplementary Table 5, Supplemen- 
tary Information section 4)—this evidence provides an occupational 
context for the Ngandong hominins. 

Laser-ablation U-series analysis yielded minimum age estimates of 
about 120 to 80 kyr and about 140 to 60 kyr for the mammalian long 
bones and eight bovid teeth, respectively (Extended Data Fig. 9, Sup- 
plementary Tables 10, 11). US-ESR dating of three additional bovid 
teeth provided age estimates corresponding to 134-118 ka (Extended 
Data Fig. 10, Supplementary Tables 8, 9). These age estimates for the 
Ngandong fauna agree with the age of deposition of the Ngandong ter- 
race (144 + 37 to 111+ 9 kyr), the age of the upper and middle terraces 
(316 to 161 kyr), the maximum age for the lower terrace at Sembungan 
(169 + 14 kyr) and the age of the lowermost terrace (31+ 6 kyr), according 


384 | Nature | Vol577 | 16 January 2020 


140 + 44 
144 + 37 
Nglebak _———— k 
~ Solo River 


the fossils and sediments (n = 26, mean ages withs.d. at 2o uncertainties). The 
composite stratigraphy of the Ngandong terrace is shown. Facies A,B, CandD 
referring to the same facies shown in Fig. 2, witha thickness of about 2.5 m. All 
three contexts have been modelled to provide a Bayesian age for the 
boundaries between the facies at Ngandong (depicted as black rectangles with 
dashed lines in the model) with the resulting mean age ranges presented at lo 
uncertainties: 140 +24 to123+9 kyr (facies A), 123 +9 to117+ 6kyr (facies B), 
117+6to108+9 kyr (facies C)and108+ 9 to92+16kyr (facies D). Alarger 
version of this model can be found in Supplementary Information section 11. 
Symbols correspond to dating techniques: red thermoluminescence, red 
squares; US-ESR, yellow circles; U-series, blue triangles; pIR-IRSL, green 
diamonds; *°Ar/**Ar, purple circle. Red and blue arrows denote maximum and 
minimum ages, respectively. 


to the red thermoluminescence, pIR-IRSL and *°Ar/**Ar dating. When 
these 52 dating results are entered into a Bayesian model (Fig. 3, Sup- 
plementary Tables 14, 15), an age range of 140-92 kyr is established for 
the deposition of the entire Ngandong terrace sequence, and a range 
of 117-108 kyr for the bone bed in facies C (Fig. 3), which contained the 
H. erectus material discovered in the 1930s. 

Our modelled age range for the Ngandong terrace is substantially 
older than the direct “°Th/*U determinations (40 to 60-70 kyr) that 
have been reported for the Ngandong H. erectus calvaria”, and is similar 
to previously reported combined U-series and ESR ages (correspond- 
ing to 77 to 143 kyr) for faunal teeth in the terrace at Jigar. However, 
the upper age limit for the Ngandong site (of about 500 kyr) that was 
previously suggested on the basis of *°Ar/*Ar analyses of water-lain 
pumice from the Ngandong formation“, and the age for Ngandong 
H. erectus calvaria of >200 kyr based on gamma-spectrometric U-series 
techniques”, are at odds with our proposed age range for the bone bed 
(117-108 kyr) and our age for the next-oldest terrace in the sequence 
(162 +33 kyr). We believe that our bone-bed chronology is much closer 
to the actual age of Ngandong H. erectus. 

Sedimentary and taphonomic observations at Ngandong are best 
explained by a single flood event, during which facies B, C and D 
accumulated in rapid succession” (Supplementary Information sec- 
tions 2,10). A reliable age estimate for the formation of the facies and 
the non-hominin fossils at Ngandong establishes a depositional age 
for the H. erectus remains (Fig. 2, Supplementary Information sec- 
tions 5-9). The fossils show only incipient weathering and transport 
damage, despite being deposited ina river (Extended Data Fig. 7). The 


preservation of delicate bony structures*” and soft tissues (Supplemen- 
tary Table 3) indicate limited exposure. The demise of H. erectus and 
other vertebrates evidently occurred upriver of Ngandong—possibly 
caused by the changing environmental conditions”. The skeletonized 
and disarticulated remains were then entrained by lahar flows” and 
monsoonal flooding of the Solo River (Supplementary Information 
section 10). The remains accumulated within large in-channel debris 
jams”, owing to the narrowing of the valley at Ngandong, and triggered 
deposition in channel bars (facies A) and gravelly sand bars (facies C), 
shortly before sandy bedforms (facies D) and channelized mudflows 
(facies E) buried the bone beds. 

This dated sedimentary and taphonomic framework for the Ngan- 
dong bone bed does not support an overlap between modern humans 
and H. erectus in this region®’®”°. Instead, the Ngandong fauna in 
facies A (140 + 24 ka) pre-dates the rainforest-associated site of Punung 
(between 128 + 15 and 118 + 3 ka)”, which agrees with the proposed 
biostratigraphical sequence of Java based on the palaeoenvironmental 
and associated faunal changes”’. The H. erectus bone bed (facies C) 
overlaps with Punung and falls within the sea level lowstand at the onset 
of termination II (around 120 ka)**. Thus, it represents the last, dying 
remnants of the archaic fauna and open woodland environments that 
were superseded by the impeding rainforest flora and fauna associated 
with Punung (Supplementary Information section 10). 

Furthermore, we can place Ngandong into a regional framework for 
Island Southeast Asia. H. erectus continuously inhabited the island, with 
dates on Java that start at 1.51 to 0.93 million years ago at Sangiran’”, 
then540 to 430 kaat Trinil® and ending with 117 to 108 ka at Ngandong. 
H. erectus was dispersed widely by 700 ka, as shown by archaeological 
evidence for hominins at Mata Menge (Flores, Indonesia)”° and Cagayan 
Valley (Luzon, Philippines)”. Two insular dwarf hominins are found on 
these outlying islands: Homo floresiensis at 100 to 60 ka** and Homo 
luzonensis at 66.7 +1ka”’. Phylogenetic relationships have yet to be deter- 
mined for these two hominins, but they show morphological similarities 
with H. erectus”. Sharing similar temporal ranges, Ngandong H. erectus, 
H.floresiensis and H. luzonensis represent three evolutionary trajectories 
of Homo in Island Southeast Asia, each of which ended in extinction. 

Genomic evidence from modern populations in New Guinea provides 
estimates for the dates of the arrival of another early hominin in Island 
Southeast Asia. Two Denisovan lineages diverged from the Altai Den- 
isovans, one at about 363 kaand the other at about 283 ka®’. These deep 
divergence dates provide evidence for the early arrival of Denisovans 
in Island Southeast Asia. Dispersing Homo sapiens encountered Den- 
isovan lineages in Island Southeast Asia at about 45.7 ka®° and at about 
29.8 ka®°. Additionally, a residual signal of approximately 1% archaic 
DNA in modern regional populations lies outside the human-Neander- 
thal-Denisovan clade®’. This may reflect a past introgression event with 
H. erectus and provide evidence that these Denisovans encountered a 
late-surviving H. erectus population. 

Anincreasingly complex picture of hominin evolution in Pleistocene 
Island Southeast Asia is emerging from fossil and genomic evidence. 
The chronology of Ngandong H. erectus is critical for this narrative. We 
have approached the age of the Ngandong site in three increasingly 
precise contexts: the Kendeng Hills landscape, the Solo River terraces 
and the Ngandong bone bed. Our age estimates for the vertebrate fos- 
sils—including H. erectus—at Ngandongare, therefore, firmly anchored 
within their regional chronological and geomorphical contexts. With 
modelled ages of 117 to 108 kyr, the Ngandong bone bed can finally 
assumeits correct position in the hominin biostratigraphical sequence 
of Island Southeast Asia. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 


acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-019-1863-2. 


1. Larick, R. et al. Early Pleistocene “°Ar/*°Ar ages for Bapang Formation hominins, Central 

Jawa, Indonesia. Proc. Natl Acad. Sci. USA 98, 4866-4871 (2001). 

2. Zaim, Y. et al. New 1.5 million-year-old Homo erectus maxilla from Sangiran (Central Java, 

ndonesia). J. Hum. Evol. 61, 363-376 (2011). 

3. Huffman, O. F., de Vos, J., Berkhout, A. W. & Aziz, F. Provenience reassessment of the 

1931-1933 Ngandong Homo erectus (Java), confirmation of the bonebed origin reported 

by the discoverers. Paleoanthropology 2010, 1-60 (2010). 

4. Oppenoorth, W.F.F. Een nieuwe fossiele mensch van Java. Tijdschrift van het Koninklijk 

Nederlandsch Aardrijkskundig Genootschap XLIX, 704-708 (1932). 

5. Frankel, M. Notes from an excavation. Nature https://doi.org/10.1038/news.2010.377 

(2010). 

6. | Santa Luca, A. P. The Ngandong Fossil Hominids: A Comparative Study of a Far Eastern 

Homo erectus Group Yale University Publications in Anthropology 78 (Yale Peabody 

Museum, 1980). 

7. _ Rightmire, G. P. The Evolution of Homo erectus: Comparative Anatomical Studies of an 
Extinct Human Species (Cambridge Univ. Press, 1990). 

8. Schwartz, J. H. & Tattersall, |. The Human Fossil Record: Volume Four - Craniodental 
Morphology of Early Hominids (Genera Australopithecus, Paranthropus, Orrorin) and 
Overview (Wiley, 2005). 

9. Swisher, C. C. Ill et al. Latest Homo erectus of Java: potential contemporaneity with Homo 
sapiens in southeast Asia. Science 274, 1870-1874 (1996). 

10. Bartstra, G.-J. et al. Ngandong man: age and artifacts. J. Hum. Evol. 17, 325-337 (1988). 

11. Rizal, Y. Die Terrasse Entlang des Solo-Flusses in Mittel und Ost-Java, Indonesien. PhD 
thesis, Universitat zu K6Ln (1998). 

12. Gruin, R. & Thorne, A. Dating the Ngandong humans. Science 276, 1575-1576 (1997). 

13. Yokoyama, Y., Falguéres, C., Sémah, F., Jacob, T. & Grtin, R. Gamma-ray spectrometric 
dating of late Homo erectus skulls from Ngandong and Sambungmacan, Central Java, 
Indonesia. J. Hum. Evol. 55, 274-277 (2008). 

14. Indriati, E. et al. The age of the 20 meter Solo River terrace, Java, Indonesia and the 
survival of Homo erectus in Asia. PLoS ONE 6, €21562 (2011). 

15. Ciochon, R. L. et al. Rediscovery of the Homo erectus bed at Ngandong: site formation of 
a late Pleistocene hominin site in Asia. Am. J. Phys. Anthropol. 48, 106 (2009). 

16. Sambridge, M., Grin, R. & Eggins, S. U-series dating of bone in an open system: the 
diffusion-absorption-decay model. Quat. Geochronol. 9, 42-53 (2012). 

17. Huffman, O. F. et al. Mass death and lahars in the tapbhonomy of the Ngandong Homo 
erectus bonebed, and volcanism in the hominin record of eastern Java. 
Paleoanthropology 2010, abstr. A14 (2010). 

18. Westaway, M. C. Preliminary observations on the taphonomic processes at Ngandong 
and some implications for a late Homo erectus survivor model. Tempus 7, 189-193 (2002). 

19. Westaway, M., Jacob, T., Aziz, F., Otsuka, H. & Baba, H. Faunal taphonomy and 
biostratigraphy at Ngandong, Java, Indonesia and its implications for the late survival of 
Homo erectus. Am. J. Phys. Anthropol. 120, abstr. 222-223 (2003). 

20. Westaway, M. C. & Groves, C. P. The mark of ancient Java is on none of them. Archaeol. 
Ocean. 44, 84-95 (2009). 

21. Jenkins, K. et al. New excavations of a Late Pleistocene bonebed and associated MSA 
artifacts from Rusinga Island, Kenya. Paleoanthropology 2012, abstr. A17 (2012). 

22. Westaway, K. E. et al. Age and biostratigraphic significance of the Punung rainforest 
fauna, East Java, Indonesia, and implications for Pongo and Homo. J. Hum. Evol. 53, 
709-717 (2007). 

23. de Vos, J. The Pongo faunas from Java and Sumatra and their significance for 
biostratigraphical and paleo-ecological interpretations. Proc. Koninklijke Nederlandse 
Akademie van Wetenschappen, Series B 86, 417-425 (1983). 

24. Railsback, L. B., Gibbard, P. L., Head, M. J., Voarintsoa, N. R. G. & Toucanne, S. An 
optimized scheme of lettered marine isotope substages for the last 1.0 million years, and 
the climatological nature of isotope stages and substages. Quat. Sci. Rev. 111, 94-106 
(2015). 

25. Joordens, J.C. et al. Homo erectus at Trinil on Java used shells for tool production and 
engraving. Nature 518, 228-231 (2015). 

26. Brumm, A. et al. Age and context of the oldest known hominin fossils from Flores. Nature 
534, 249-253 (2016). 

27. \Ingicco, T. et al. Earliest known hominin activity in the Philippines by 709 thousand years 
ago. Nature 557, 233-237 (2018). 

28. Sutikna, T. et al. Revised stratigraphy and chronology for Homo floresiensis at Liang Bua in 
Indonesia. Nature 532, 366-369 (2016). 

29. Detroit, F. et al. Anew species of Homo from the Late Pleistocene of the Philippines. 
Nature 568, 181-186 (2019). 

30. Jacobs, G. S. et al. Multiple deeply divergent Denisovan ancestries in Papuans. Cell 177, 
1010-1021 (2019). 

31. Suminto, M. M., Sidarto, Maryanto, S., Susanto, E. E., Aziz, S. F., Christiana, |., & Fitriani, E. 
A Study of the Solo River Terraces from Kerek to Karsono: Ngawi and Bojonegoro Regions, 
East Java (Geological Research and Development Centre, 2004). 

32. Sidarto & Morwood, M. J. Solo River terrace mapping in the Kendeng Hills area, Java: use 
of landsat imagery and digital elevation model overlays. J. Sumber Daya Geologi 14, 
196-207 (2004). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2019 


Nature | Vol577 | 16 January 2020 | 385 


Article 


Methods 


To constrain age range of the Ngandong fossils, we considered three 
interrelated components: a landscape context, a terrace context and 
a fossil context. 


Establishing the landscape context 

The landscape evolution of central Java followed this sequence of 
events: (1) the seismic uplift of the Southern Mountains, (2) the north- 
ward diversion of the Solo River through the Kendeng Hills and (3) 
the resulting terrace formation, including that of the Ngandong ter- 
race***, The timing of these events can be constrained by establishing 
the age of the oldest speleothem deposits in the Gunung Sewu (Fig. 1b) 
using U-series dating, which provides a minimum age for the uplift 
of the Southern Mountains, the diversion of the Solo River and the 
subsequent uplift of the Kendeng anticlinorium (which created the 
Kendeng Hills)—and thus a maximum age for terrace formation in 
the Solo River valley. 


U-series dating of speleothems. U-series dating of flowstones and 
stalagmites provides minimum ages for the karstification of the Gunung 
Sewu, which was initiated by the uplift of the Southern Mountains. 
Calcite was sampled from the outer edges of the stalagmites using a 
hammer and chisel. Calcite crystals that were free of any weathered 
surfaces were extracted from each of these samples, and cleaned ultra- 
sonically to remove as much sediment contamination as possible before 
they were subjected to chemical treatment and isotopic measurements 
by mass spectrometry®. U-series dating of the speleothem samples 
was conducted in the Radiogenic Isotope Facility (The University of 
Queensland), using a VG Sector 54 thermal ionization mass spectrom- 
eter (TIMS) anda Nu Plasma multi-collector inductively coupled mass 
spectrometer (MC-ICP-MS). Analytical procedures followed previous 
publications for TIMS and MC-ICP-MS*®>”, °Th/?4U ages were calcu- 
lated using Isoplot EX 3.75 (ref. °°), and half-lives of 75,690 years (?°Th) 
and 245,250 years (734U)”. 


Establishing the terrace context 

As strath river terraces form in a sequence (with the oldest at the high- 
est elevation and the youngest at the lowest), the age of the oldest Solo 
River terraces at Kerek (upper) and the youngest in Nglebak (lowermost) 
provide amaximum and minimum age range for the Ngandong terrace. 


Strategy for identifying and dating the terraces. Initial mapping work 
inthis region involved the use of digital elevation maps of the terraces 
created from a Landsat image (ETM7+ using bands 4, 5 and 7 merged 
with band 8) overlying a 1:25,000 topographical map, to produce a 
15-m resolution digital elevation model*!. Suitable sites were then 
chosen from each of the four identified terraces (lowermost, lower, 
middle and upper) within a designated study section from Kerek vil- 
lage in the south to Sunggun village in the north (Fig. 1c, Extended Data 
Fig. 1f). Compared to Sangiran and Mojokerto, the site of Ngandong 
is within much-younger clastic terrace deposits, which presents an 
opportunity to constrain the age of the deposition of the terrace using 
luminescence dating. This technique has yet to be successfully applied 
to this river system. The nature of the quartz and feldspar grains in 
this volcanic province is challenging, with no usable blue signal and 
high anomalous fading, respectively. This necessitates the use of red 
thermoluminescence dating techniques on quartz grains** and pIR- 
IRSL on feldspars™. This is supported by *°Ar/*’Ar techniques on in situ 
pumice lens in the alluvium at Sembungan (low terrace). 


Luminescence dating of quartz and feldspar grains from the fluvial 
terraces. At each site, a suitable sampling location was chosen, a sec- 
tion was dug to expose the terrace sediments and the stratigraphic 
characteristics were recorded. Sampling for luminescence dating was 


conducted using either opaque PVC pipes banged into the terrace (labo- 
ratory codes IND1-18 and NGD1-3) or—for the more-cemented terrace 
sediments—bulk sampling was conducted at night using subdued red- 
light conditions (laboratory code KER1). Quartz and feldspars grains 
of 90-125 pm were separated using standard purification procedures, 
including a final etch in 40% hydrofluoric acid for 45 min and 10% for 
10 min, respectively, to remove the external alpha-dosed rinds*’. The 
Solo River terraces yielded small amounts of quartz: 60 mg, with about 
20 mg used for aliquots A and B of the dual-aliquot protocol (DAP) 
procedure“ to derive the D, estimates (Supplementary Table 6), and 
only about 40 mg remaining for additional testing. Therefore, feldspars 
were also analysed to support the sedimentary chronology, which also 
yielded small—but usable—amounts. All luminescence analysis was 
conducted at the ‘Traps’ luminescence dating facility at Macquarie 
University. 

Using a DAP“*, isothermal red thermoluminescence emissions from 
quartz were detected using a red-sensitive photomultiplier tube (Elec- 
tron Tubes S20 9658B). Quartz grains were mounted onstainless-steel 
discs using silicone oil spray; each large aliquot was composed of about 
5,000 grains (around 10 mg). The isothermal red thermoluminescence 
emissions** *’ were measured using a red-sensitive photomultiplier 
tube (Electron Tubes 9658B) and cooling tower (LCTSO liquid-cooled 
thermoelectric housing) with Kopp 2-63 and BG-39 filter combination®. 
Laboratory irradiations were conducted using a calibrated °°Sr/*°Y 
beta source. D, values were estimated from the 20-30-s interval of 
isothermal decay (which was bleachable by >380-nm illumination) 
using the final 160 s as background. Aliquots were heated to 260 °C at 
aheating rate of 5K stand then held at 260 °C for 1,000 sto minimize 
the unwanted thermoluminescence from incandescence. 

To overcome the problems of anomalous fading*’, we adopted a 
standard pIR-IRSL protocol for single aliquots of feldspars using a 
300 °C preheat and 270 °C pIR-IRSL stimulation combination, following 
astandard 50-°C infrared stimulation. The use of single-grain feldspar 
techniques was investigated but low sample sensitivities yielded very 
low acceptance rates (<0.5%) that were not practical considering the 
small amounts of sample yield. pIR-IRSL measurements were thus con- 
ducted on single aliquots of feldspars using infrared (875-nm) light- 
emitting diodes at 80% power for 200 s (to enable a long stimulation), 
and the emissions were detected using Schott BG-39 and Corning 7-59 
filters to transmit wavelengths of 320-480 nm (ref. ”). Four procedural 
tests were applied to small aliquots of about 1,000 grains using the 
following preheat and infrared stimulation combinations: (1) 250 and 
225 °C (refs. 48°), (2) 280 and 250 °C (ref. ”), (3) 300 and 270 °C (ref. *) 
and (4) 320 and 290 °C (refs. **?). The tests were: (1) a preheat plateau 
test using 3 discs; (2) fading tests, including a prompt, 1-h, 10-h and 
1-week delay; (3) bleaching tests using 1 fresh aliquot per temperature 
to determine the amount of residual IRSL after an extended bleach of 
4hinasolar simulator; and (4) dose recovery tests using 8 bleached 
aliquots (bleached using a solar simulator for 4h) anda surrogate dose 
of 200 Gy. From these tests, it was determined that the 270-°C stimula- 
tion and 300-°C preheat combination plotted within the flattest part of 
the preheat plateau, provided the best recovery of the surrogate dose, 
with the least fading of all of the pIR-IRSL signals (g value = between 
1.6-2.2% per decade) and lowest residual value (<10 Gy). In total, 24 
aliquots were used to conduct a modified single-aliquot regenerative- 
dose (SAR) procedure. These resulting age estimates were corrected 
according to the results of the anomalous fading tests (using a weighted 
mean fading rate of 1.9 + 0.3% per decade), but no residual corrections 
were undertaken. 

Measurements of °U, **U, 7”Th (and their decay products) and “°K 
were estimated using Geiger-Muller multi-counter beta-counting of 
dried and powdered sediment samples in the laboratory, anda portable 
gamma spectrometer in the field. The corresponding (dry) beta and 
gamma dose rates were obtained using previously published conver- 
sion factors™ and beta-dose attenuation factors”. An effective internal 


alpha dose rate of 0.03 Gy kyr (ref. °°) and 0.72 Gy kyr‘ (refs. *”*°) were 
used for the 90-125-jm quartz and feldspar samples, respectively 
(owing to the radioactive decay of *°K and *’Rb), which were made 
assuming K (12.5 + 0.5%)” and ®’Rb (400 +100 1g g")* concentrations, 
and was included in the total dose rate. Cosmic-ray dose rates were 
estimated from published relationships’, making allowance for the 
sediment overburden at the sample locality (about 0.5-4.0 m), the 
altitude (around 60 m above sea level) and geomagnetic latitude and 
longitude (7° and 111°) of the sampling site. The total dose rate was 
calculated using a long-term water content of 5-15 + 2%, whichis close 
tothe measured (field) water content of 5-15%. High-resolution gamma 
spectrometry of the powdered sediment samples was also conducted 
to test for disequilibrium within the uranium decay chain (Supple- 
mentary Table 7). 


4° Ar/**Ar dating of a pumice lens in the Sembungan terrace. *°Ar/?’Ar 
dating of single and multiple hornblende crystals was conducted ona 
pumice lens (sample SS59B) taken from the middle of the Sembungan 
terrace, to provide the eruption age of the pumice and a maximum age 
of deposition for the terrace. The 10-cm-thick pumice lens was exposed 
inthe western quarry wall over alength of 2.2 mata depth of 2.0 m below 
datum, and 2.1m above the base of the terrace fill. Euhedral hornblende 
crystals up to 1 mm in length were hand-picked under a binocular mi- 
croscope and loaded into wells in aluminium sample discs (diameter 18 
mm) for neutron irradiation, along with the astronomically calibrated 
1.185-million-year-old Alder Creek sanidine® as the neutron fluence 
monitor. Neutron irradiation was performed for 0.25 hinthe cadmium- 
shielded CLICIT facility at the Oregon State University TRIGA reactor. 

Argon isotopic analyses of the gas released by laser fusion of horn- 
blende crystals (Supplementary Table 12) was done on a fully auto- 
mated, Nu Instruments Noblesse multi-collector noble-gas mass 
spectrometer, using previously documented instrumentation and 
procedures”, 


Estimating downcutting rates. We used the terrace chronology to 
establish downcutting rates for the Solo Valley in the Kendeng Hills. 
These rates were established by dividing the age of each alluvial ter- 
race (in thousands of years) (Supplementary Table 6), by the distance 
of downcutting between each terrace (mm) to provide a downcutting 
rate (in millimetres per thousand years (mm kyr”’)) for each terrace 
level. Then, the age of the highest terrace was divided by the total dis- 
tance to the river (66 m) to estimate an overall rate of downcutting (in 
mmkyr’) for the entire valley (Supplementary Information section 15). 
The mean errors associated with these red thermoluminescence and 
pIR-IRSL ages have been propagated through to the final downcutting 
rate, to obtain an error margin of +8%. 


Establishing the fossil context 

The strategy for the Ngandong excavation consisted of locating the 
backfilled edges of the original excavated area along the margins of 
the original Netherlands Indies Survey reserve, using maps produced 
during the 1931-1933 excavations’. Our 115-m? excavation footprint 
encompassed several pits that paralleled the edges of the excavation 
reserve. We identified five lithofacies, comprising the terrace fill at 
Ngandong, and developed a composite cross-section that illustrated 
lateral facies relationships and the context of the bone bed within the 
sequence of deposits (Fig. 2, Supplementary Table 2). Attempts to 
date fossil teeth from facies A and facies C using radiocarbon were 
unsuccessful, as was initial optically stimulated luminescence dating 
of quartz from sediments. However, red thermoluminescence and pIR- 
IRSL dating of quartz and feldspars, respectively, from overbank terrace 
deposits, and U-series dating of fossil teeth and bones proved effective. 


Laser-ablation U-series dating of fossil bone and teeth. Fifteen 
bone samples for U-series dating were collected during the 2010 field 


season (sample numbers 2146, 2178, 2190, 2216, 2269, 2284, 2286, 2291, 
2319, 2330, 2331, 2388, 2404, 2476 and 2481) from facies A and facies C. 
Figure 2 gives the relative positions of these samples. Laser-ablation 
mass spectrometry to measure U-series isotopes along the cross- 
sections of these dense, mammalian long-bone fossils was carried 
out at the Australian National University?’ © (Extended Data Fig. 9). 
Uranium and thorium concentrations were derived from repeated 
measurements of the NBS-610 standard, uranium-isotope ratios from 
the dentine of a rhinoceros tooth from Hexian®. Spot analyses were 
used, which have the advantages over continuous tracks by being ableto 
avoid pores and optimize measurement conditions as well as counting 
statistics for each analysis. The laser was keptin one position for 100s, 
ablating a small pit (132 um in diameter, approximately 50 to 100-~um 
deep) inthe bone. To address U-series age issues related to uncertainties 
in open-system uranium uptake, the ”°Th/*°U and *“*U/*U datasets 
for each bonewere fitted using a diffusion-absorption-decay model®. 


Coupled US-ESR dating of fossil teeth. Bovid molars were collected 
for US-ESR direct dating, and sectioned using a large diamond-blade 
rotating saw and polished to a 100-m surface smoothness. Samples 
NDG-1038, NDG-1163, NDG-2074, NDG-2562, NDG-2566 and NDG-2569 
were first analysed by laser-ablation ICP-MS (LA-ICPMS) quadrupole 
for uranium distribution, to assess the suitability of the samples for 
US-ESR dating. Only the samples NDG-1038, NDG-1163 and NDG-2569 
were found to be suitable, and these samples were prepared following 
a previously developed protocol®™ (Extended Data Fig. 10). Each frag- 
ment was then measured at Southern Cross University on a Freiberg 
MS5000 ESR X-band spectrometer, and irradiated with the Freiberg 
X-ray irradiation chamber. ESR intensities were extracted from the 
merged spectra obtained from angular variation measurements” 
(for an example, see Extended Data Fig. 10b), after correcting for 
baseline, subtraction of isotropic signals and assessment of the con- 
tribution of non-oriented CO, radicals (NOCORs) using previously 
published protocols” (for an example, see Extended Data Fig. 10c). 
Dose-response curves were obtained using the MCDOSE 2.0 soft- 
ware” (Extended Data Fig. 10a). U-series dating was conducted on 
both dentine and enamel at the University of Wollongong, using an 
ESI NW193 ArF Excimer laser coupled to a MC-ICP-MS Neptune Plus to 
calculate the internal dose rate. All age calculations were carried out 
with the US-ESR program”, which uses previously published” dose 
rate conversion factors. 


Modelling of landscape, terrace and excavation chronologies 

To evaluate the uncertainties of the integrated dating approach of 
the landscape, terrace and fossil contexts (Supplementary Tables 6, 
9,10, 12, 13), Bayesian modelling was performed on all independent 
age estimates using the OxCal (version 4.2) software” available at 
https://cl4.arch.ox.ac.uk/oxcal.html (Supplementary Tables 14, 15). 
The analysis incorporated the probability distributions of individual 
ages, and constraints imposed by stratigraphic relationships and the 
reported minimum or maximum nature of some of the individual age 
estimates. Each individual age was included as a Gaussian distribution 
(with mean and s.d. defined by the age estimate and their associated 
uncertainties). The U-series profiling ages on the fossil bone yieldeda 
range of ages (160-60 kyr) and these were incorporated as a uniform 
distribution over this interval. 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Data availability 


The data that support the findings of this study are included in the Sup- 
plementary Information. Additional data are available from the cor- 
responding authors upon reasonable request. 


Code availability 


The Oxcal code used for the Bayesian model in this study is included 
in Supplementary Table 15. 
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Extended Data Fig. 1|See next page for caption. 
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Extended Data Fig. 1| Evolutionary and geomorphical history of region. The 
landscape evolutionary stages that created the Solo River terraces. Drawn from 
refs.?? ona topographical map from the USGS EROS. a, More-than 500-ka 
drainage from the proto-Merapi and Lawu volcanic highlands formed alake or 
lagoon from which the proto-Solo River drained to the south (blue arrow), close 
to the present-day Pacitan region, and another branch flowed to the north of 
Lawu. By at least 1.5 million years ago, the Southern Mountains to the south and 
the Kendeng anticlinorium to the north were slowly emerging, forming the 
Gunung Sewu, and the Kendeng Hills (previously the Randublantung marine 
embayment’), respectively. b, By about 500 ka, the seismically uplifted 
Southern Mountains had blocked the southern exit of the Solo River to the 
ocean, and the area was dominated by trunk streams of the Solo River. 

c, Between about 500 and 316 ka, the Solo River abandons its southern trunk 
stream and extends its northern branch, where it is diverted to the west and 
northeast and carves an initial crossing through the Kendeng Hills to form the 
Solo River gap, and drain into the ocean tothe north of Surabaya. d, Between 


about 316 and 31 ka, the uplifting Kendeng anticlinorium and the drainage from 
the Madiun Basin energized the Solo River, causing incision and forming the 
Solo River sequence of terraces (white parallel lines). e, Present-day Solo Basin 
and known fossil sites on exposed terraces. f, A digital elevation model””, 
comprising a satellite image overlying a topographical map of the section of 
the Solo River system from Kerek village in the south to Sunggun village inthe 
north (USGS Landsat). g, The same digital elevation model, with the 
classification scheme for the Solo River terraces with the upper, middle, lower 
and lowermost terraces identified. This image includes the key terrace sites 
that are sampled in this study; Kerek (upper), Padasmalang (middle), 
Sembungan (lower), Nglebak (lowermost) and Menden (outside of the Kendeng 
Hills, but contemporaneous with the upper and middle terraces), and the key 
fossil site of Ngandong (lower). The white dashed line indicates the limits of the 
Kendeng Hills. The Menden terrace lies outside of this divide, as does the 
westward-bearing Solo River and the site of Trinil. 
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Extended Data Fig. 2| Terrace-site stratigraphy and luminescence sampling. 
a, Map of the Kendeng Hills section of the Solo River from Ngawito Menden, 
displaying the location of the six studied sites. Each site has a smaller inset 
showing the site locations, stratigraphic sections of the strath terraces and 
sampling locations for luminescence dating. b, Ngandong; the inset for the 
Ngandong site is shownin more detail to identify the exact location of the 
sampling site within the context of previous excavations. c, Sembungan 


Kerek 
Village 


Overall downcutting rate 


Solo River 
channel 


200 mmika 1 Key 
| es Section location & Buildings 
I @@@) Sampling locations [S| Marl clasts 
568mmika_ | 5 fl 
I Ps. Reserve Marl bedrock 
Excavated area 
58 mm/ka I 


161 mmika_ | 


excavation I.d, Sembungan excavation II. e, Menden. f, Nglebak. 

g, Padasmalang.h, Kerek. i, Three-dimensional slice of the Solo River valley, 
showing the terrace sequences and resulting downcutting rates (derived from 
21terrace samples (n= 21), mean ages with uncertainties presented at 1o) 
plotted according to elevation and distance away from the river. The associated 
downcutting rates have been presented for each terrace, and for the river 
system asa whole. 
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Extended Data Fig. 3| See next page for caption. 


Extended Data Fig. 3 | Stratigraphy and artefacts of the Sembungan 
excavations. a, Map of the Sembungan terrace, showing the lithology of the 
terrace andthe location of the terrace rise in relation to the Solo River. The 
bottom-right inset shows the locations of the excavations I, II and V. b, A profile 
of the terrace along the A-B transect from A (marked by a red dashed line), 
showing the location of the sand quarry excavations in relation to the river. 

c, The west baulk of excavation I (marked on the inset in A), showing the 
stratigraphic layers and location of the stone artefact concentration (red 
dashed line at the base of the section). Layer J, very coarse sand; layer I, brown 


silt; layer H, cross-bedded coarse pebbly sand; layer G, lenses of siltstone; layer 
F, disturbed; layer E, massive siltstone; layer D, caliche palaeosol. d-i, Stone 
artefacts excavated in situ from Sembungan. d, Obsidian flake. e, Chert flake 
with unifacial retouching to the ventral surface across the proximal end, 
removing the point of force application. f, Chalcedony flake. g,h, Chalcedony 
centripetal cores. i, Quartz crystal cluster. Scale bars, 30 mm.j, A stone tool 

in situ from the Sembungan excavation V. k, Excavation I (inset ina). 

L Excavation II (inset ina). 
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Extended Data Fig. 4| See next page for caption. 


Extended Data Fig. 4 | Mendenstratigraphy and fossils. a, The quarry site of 
Bloraon the Menden terrace near Sunggu (Central Java, Indonesia), tothe north 
of the Kendeng Hills. The red dashed lines depict mega cross-bedding inthe 
fluvial terrace. The vertical blue lines correspond to the stratigraphic section 
shown in b the yellow dashed line depicts the landslide scarp, and the black box 
shows the location of an almost-complete elephant skeleton. b, The stratigraphy 
of the Menden terrace according to logs A and B (marked ona). The upper al, a2 
and blayers represent cross-laminated sands and gravels, and the lower c-g 
layers represent cross-bedded pebbly sandstones. The relative location of the 
elephant skeleton can be seen by the fossil symbol.c, The excavation of the 
Mendenterrace to recover the elephant skeleton (Elephas hysudrindicus)—a 
rare elephant species, endemic to Java. d, Site plan of the partial F. hysudrindicus 
skeleton excavated from the Menden terrace. Thick dashed line indicates 


extension of the excavation. Red dashed line indicates the boundary of the 
quarry wall at the time the fossil was discovered. All fossils recovered south of 
this boundary were found ina landslide at the foot of the quarry wall. 1, partial 
skull; 2, right tusk; 3, left tusk; 4, mandible; 5, cervical vertebrae; 6, thoracic 
vertebrae; 7, lumbar vertebrae; 8, caudal vertebrae; 9, right scapula; 10, right 
humerus; 11, right radius; 12, right carpals; 13, right pelvis; 14, right femur; 15, 
right tibia; 16, right fibula; 17, right patella; 18, right pes (articulated); 19, left 
pelvis; 20, left tibia; 21, left radius; 22, left tarsals; 23, left scapula fragment; 
24, left humerus; pale yellow bones are ribs. e, The right pelvis and femur of the 
elephantin articulation, lying next to the left tibia and fibula and tarsals. f, The 
broken lower jaw of the elephant, with teeth, recovered from the landslide. 

g, Theskull of the elephant in cross-section, as found inthe landslide scar. 
Convoluted sediment layers can be seen below the skull. 
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Extended Data Fig. 5 | History of H. erectus excavations at Ngandong. 

a, Aerial view of Ngandong, created from an unpublished map produced by the 
Geological Survey of the Netherlands Indies, who discovered the site and 
documented the unearthing of 14 H. erectus specimens. b, Extent of the 
27-month-long, 1931-1933 excavations, including H. erectus finds. The 
excavations produced about 25,000 fossils from the Ngandong terrace 
(originally referred to as the 20-m terrace) deposits’. c, Redisplay and 
translation ofan original stratigraphic profile, published by the Geological 
Survey of the Netherlands Indies, showing the first four H. erectus discoveries 


H. erectus find 


MARL BEDROCK EXCAVATION | 
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made in 1931°4. d, Day-of-discovery photograph of Ngandong VI (Ng 7), whichis 
awhole fossil calvaria’. e, Plan-view drawing of the excavation square that 
included Ngandong VI, embedded ina river deposit of very coarse-grained 
volcaniclastic sand, along with marl cobbles and other vertebrate fossils°. 

f, Location of the site in the greater Ngandong area. g, Total datastation 
mapping allowed the 1931-1933 excavated area to be repositioned onthe 
landscape, including the 1931-1933 H. erectus discovery points (Extended Data 
Fig. 6). Panels a-f are redrawn froma previous publication’. 


Extended Data Fig. 6 | Photographs of 2008 and 2010 excavations at 
Ngandong including fossil discoveries. a, View of Ngandong site before the 
2008 excavation, facing northwest. The orange string line marks the extent of 
the 1931-1933 excavations’. b, Collection of samples for optically stimulated 
luminescence dating, from facies B and C in pit A from 2008 (excavation unit 
H10a of the 2010 excavation). c, Bovid scapula and other fossils found in facies 
Cin H10a from 2010. d, Excavations underway in excavation units H10a 


(foreground) and H10c (being dug) in 2010. e, Stratigraphy seeninthe 
northwest wall of excavation unit H10a in 2010. Facies E is seen above the 
remnant of facies A, B, Cand D, whichare visible in the bottom half of this 
section. f, Exposed bone bed in facies A and C in excavation unit GO9 from 2010. 
g, Fossils collected during 2010 excavation. Photographsa, cand eare by O.F.H. 
Allother photographs are by R.L.C. 
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Extended Data Fig. 7 | Fauna from Ngandong recovered during the 2008- sp.), specimen NDG 2569. i, Bovid tooth (Bubalus sp.), specimen NDG1163. 
2010 excavations. a, Cervid antler, cf. Axis sp., specimen NDG 2306. b, Lower j, Artiodactyl canon bone, specimen NDG 2148. k, Artiodactyl hoof, specimen 
right M3, cf. Bos sp., specimen NDG 1134. c, Bovid incisor (Ix), cf. Bubalus sp. NDG 2199. Specimens NDG-1038, NDG-1163 and NDG-2569 (e, g and i) provided 


NDG 1106. d, Bovid cervical vertebra (atlas), specimen NDG 2149.e, Bovidtooth __ results for US-ESR age calculations (Extended Data Fig. 10). All photographs 
(Bubalus sp.), specimen NDG 1131. f, Cervid tooth, cf. Cervus sp., specimen NDG are byJ.-P.Z. 
2074. g, Bovid tooth (Bubalus sp.), specimen NDG 1038. h, Bovid tooth (Bubalus 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Acomparison of red thermoluminescence and 
pIR-IRSL luminescence data for sample NDG-1. a, Quartz red 
thermoluminescence isothermal decays, showing a natural and regenerative 
decay. b, The dose response of aliquot A of the DAP technique witha D, value of 
185 +53 Gy. The points represent the mean withs.d. uncertainties (too small to 
see at this scale). c, The dose response of the subtracted aliquot B of the DAP 
technique witha D, value of 170 + 53 Gy. The points represent the mean value 
with anerrorasas.d. of the fit (too small to see at this scale). d, Photographs of 
the luminescence emitted by asample from the Ngandong terrace (NDG-1) 
compared toasample from the Wae Raceng terrace in Flores (WR-1). The Flores 
terrace is so bright it has bleached the photographic paper, whereas the 


Ngandong terrace is much dimmer but the red luminescence emissions are 
clearly visible. e, Feldspar pIR-IRSL decays for sample NDG-1, showing the 
natural and regenerative decays. A long stimulation time is required to remove 
all of the pIR-IRSL signal. f, Adose-response curve for the sample NDG-1 witha 
D.value of 150 + 4 Gy. Each dose point represents the mean value withs.d. 
uncertainties (too small to see at this scale). g, Fading tests for the sample 
NDG-1, comparing the fading of the infrared signal at 50 °C (IRs) with the 
fading with the pIR-IRSL signal at 270 °C (pIR-IRSL37.)—demonstrating the 
isolation of a very small fading signal. The points represent the median value 
witha standard error. h, Radial plot of the NDG-1single-aliquot data. 


Extended Data Fig. 9 | U-series-age depth dating of bone. a-l, Fossil bone recovered from facies A and C. Figure 2 gives the locations of the bones. 
recovered from the Ngandong excavations in 2010, displaying the track lines Specimen numbers (NDG) for each bone are listed in white in the top right 
created by the LA-ICP-MS for U-series-age depth modelling. Bones were corner. 
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Extended Data Fig. 10 | See next page for caption. 


Extended Data Fig. 10 |Summary of the US-ESR dating protocol, and results 
for sample NDG-1038. a, Left, spectra of the merged signal increasing with 
irradiation steps. Top right, double saturated exponential dose-response 
curve of NDG-1038, using the MCDoseE 2.0 program”. Bottom right, dose 
equivalent distribution, using the MCDoseE 2.0 program”. b, Angular response 
of the enamel fragment inthe ESR spectrometer at various irradiation steps 
from left to right and top to bottom: natural, 380s, 1,800sand7,200s. 

c, Determination of the NOCOR percentage in the angular response after 
subtracting the natural signal”. d, Uranium-uptake model in both enamel (red) 


and dentine (black) used to calculate the US-ESR dating of NDG-1038. e, US- 
ESRage distribution for NDG-1038 using a previously published program”. 
f-h, Photographs of the three bovid molar teeth. f, NDG-1038. g, NDG-11163. 

h, NDG-2569. These teeth were used for the direct dating by US-ESR of the 
Ngandong bone bed, with indication of U-series measurement locations. Teeth 
were sectioned to expose the various dental tissues. Numbers in white circles 
correspond to the dentine, and numbers in blue circles correspond to the 
enamel measurements. Results for each laser spot can be found inthe 
Supplementary Table 8. 
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Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


O For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection All software used for data collection is commercially available or described in the published literature (citations included). Software used 
includes: Microsoft Excel and MCDoseE (version 2.0) (for ESR dating). 


Data analysis All software used for data collection is commercially available or described in the published literature. Software used includes: Microsoft 
Excel, Isoplot EX (version 3.75) (U-series), MATHLAB USESR (US-ESR), OxCal (version 4.2) (Bayesian modeling), Riso Sequence Pro and 
Analyst. Full OxCal code for the Bayesian modeling is provided in Supplementary Table 15. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


The data that support the findings of this study are included in the Supplementary Information. Additional data is available from the corresponding author upon 
reasonable request. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


[_] Life sciences 


[_] Behavioural & social sciences X]_ Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Data exclusions 


Reproducibility 


Randomization 


Blinding 


To determine the age of the hominin site of Ngandong, we considered three interrelated components; a landscape context, a terrace 
context, and a fossil context. Samples were dated using Uranium-series dating of speleothems, red TL, pIR-IRSL, 40Ar/39Ar, U-series 
on fossils, and ESR on fossil teeth. In total, we provide a modelled site age using 52 radiometric ages estimates. The study also 
includes the analysis of fossils and artefacts recovered from the contemporary sites of Ngandong, Sembungan, and Menden. 


To constrain the site of Ngandong we choose samples from a landscape, regional and local perspective. We chose speleothem 
samples from the 5 highest individual cave systems in the Gunung Sewu region. We chose optimum terrace sites from a large terrace 
survey conducted by the GRDC. Finally we determined excavation locations at Ngandong from extensive analysis of previous 
excavation to determine the original in situ locations and conducted multiple excavation baulks in these locations to optimise the 
sampling potential. 


To establish a statistically significant sample size our sampling strategy was to sample from multiple locations in the Ngandong region 
as described above. Five speleothem samples were collected from the highest (therefore the oldest) caves sites in the Gunung Sewu 
—this number allows the oldest age range of speleothem precipitation to be estimated. We collected 21 luminescence and Ar-Ar 
samples from six different terraces in six different locations in the Ngandong area. By sampling this widely, we were able to robustly 
constrain each of the steps of terrace evolution in the Solo Valley. The chronology of the terraces were tested with independent age 
estimates from three different luminesce dating techniques (Red TL heat reset, Red TL light reset and pIR-IRSL techniques) and Ar-Ar 
dating. To constrain the fossil assemblage in the Ngandong terrace were conducted 26 direct dating age estimates of the bone and 
teeth from throughout the terrace across all four of the named Facies (A-D). Samples were chosen according to the weathering state 
of the enamel/dentine or collagen. Particular emphasis was placed on dense, thick bones without cortical sections. Such bones come 
closest to conform to the general assumption of the diffusion adsorption model. By developing a comprehensive sedimentological 
and fossil context, our dating strategy produces a robust chronology for the Ngandong evidence. In regards to the sampling strategy 
for the excavation at Ngandong, we excavated pits on both sides of the 1930's archaeological reserve (5 pit in 2008 and 12 pits in 
2010) across the Ngandong terrace and collected 867 specimens. In respect to the specimens collected by previous excavations, our 
sample is limited to the published literature and a small collection of fossils because the majority of the 1930's collection was lost 
during World War II. 


The Ngandong excavation data was recorded by the members of ITB including palaeontologists and archaeologists over two field 
periods (2008 and 2010). The Solo Terrace data was collected by GSI over a five year period. All data pertaining to the dating 
techniques were collected by the appropriate dating specialists during collection in 2008 and during laboratory analysis from 
2008-2019. 


Excavations and field data collection were conducted between 2005 and 2010 at multiple sites in the Solo River Valley, Central Java, 
Indonesia. 


Samples were excluded if they failed predetermined tests for each dating method. These tests and exclusion criteria are outlined for 
each dating method in the Methods Summary and Supplementary Information. Fragmentary fossils that could not be identified were 
excluded from analysis for the excavations. For the US-ESR dating of three teeth samples NDG-2074, NDG-2562 and NDG-2566 were 
excluded from the dating as no enamel layer offered a uranium concentration <5 ppm, which has been described as the maximum 
acceptable concentration within the enamel layer to obtain a reliable equivalent dose. The ESR dating of the 15 bone samples did not 
yield any meaningful results as porous bones often display considerably more complex U-migration paths. For the luminescence 
dating single-aliquots were rejected according to a rejection criteria devised by cited reference. 


Multiple samples (52) and dating methods (seven) were used to determine a chronology for the Ngandong evidence. For both the 
sedimentological and fossil context independent ages estimates were employed to guarantee that the results are reliable and 
reproducible. Age estimates are consistent and stratigraphically correct between samples. 


Samples were collected for a specific dating method. Fossil specimens were allocated into taxonomical groups. 


Blinding was not relevant to the sample dating or the archaeological and paleontological research. 


Did the study involve field work? Yes [| No 


Field work, collection and transport 


Field conditions 


Field sample collection and excavations took place at multiple location and times in the Solo River Valley, Central Java, Indonesia 
(tropical climate). 
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Location Field sample collection and excavations took place at multiple sites in the Solo River Valley, Central Java, including Ngandong, 
Sembungan, Menden, Padasmalang, Kerek, and Nglebak as well Song Gupuh in Gunung Sewu, East Java (see Figure 1 for 
locations). 


Access and import/export Excavations at Sembungan were undertaken under a recommendation letter from the Provincial Government of West Java to 
the Governor of the Central Java Province No. 070.10/237; a recommendation letter from the latter to the local government of 
the Blora Regency No. 070.10/237; and a research permit issued by the Blora Regency No. 071/457/2005. Excavations at 
Ngandong were carried out with the permission and recommendation of Mr. Wahyu, Head of the Foreign Researchers Licensing 
Secretariat of the State Ministry of Research and Technology (SMRT) that issued research permits 03799/SU/KS/2006, 1718/FRP/ 
SM/VII/2008, and 04/TKPIPA/FRP/SM/IV/2010 for the fieldwork at Ngandong. The excavations at Sembungan and the Menden 
Terrace site in the Blora Regency were carried out under Research Permit Number 2785/FRP/SM/X|/2008. 


Disturbance Excavation procedures and techniques followed national and local regulations. 


Reporting for specific materials, systems and methods 
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We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Palaeontology 


Specimen provenance Fossils were recovered from excavations at Ngandong, Sembungan, and Menden. More details about the provenance of these 
fossils is included in the Supplementary Information. A full list of permits for the excavations can be found in the 
Acknowledgments and in "Access and import/export" above. Permits were obtained for all specimens that were removed from 
Java, Indonesia, for laboratory analysis. 


Specimen deposition Fossils recovered from the 2008-2010 excavations at Ngandong are housed at the Institut Teknologi Bandung, Indonesia. Fossils 
and arefacts from the Sembungan and Menden excavations are under the care of the GSI, Indonesia. 


Dating methods The Methods Summary and Supplementary Information provides detailed descriptions of each dating method. Uranium-series 
dating of the speleothem samples was conducted in the Radiogenic Isotope Facility of The University of Queensland using VG 
Sector 54 thermal ionisation mass spectrometer (TIMS) and a Nu Plasma multi-collector inductively coupled mass spectrometer 
MC-ICP-MS). All luminescence analysis, red TL and pIR-IRSL, was conducted at the "Traps" luminescence dating facility at 
acquarie University in Sydney, Australia using a TL-DA-20 Luminescence reader. Neutron irradiation for 40AR/39Ar dating of 
hornblende crystals was conducted in the cadmium-shielded CLICIT facility at the Oregon State University TRIGA reactor. Laser 
ablation mass spectrometry to measure U-series isotopes along the cross-sections of these dense, mammalian long-bone fossils 
were carried out at the Australian National University. U-series analysis of fossil teeth was conducted at the Australian National 
University. Additional U-series measurements and ESR measurements were undertaken at Southern Cross University. 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 
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The motor cortex controls skilled arm movement by sending temporal patterns of 
activity to lower motor centres’. Local cortical dynamics are thought to shape these 
patterns throughout movement execution’ *. External inputs have been implicated in 
setting the initial state of the motor cortex”®, but they may also have a pattern- 
generating role. Here we dissect the contribution of local dynamics and inputs to 
cortical pattern generation during a prehension task in mice. Perturbing cortex to an 
aberrant state prevented movement initiation, but after the perturbation was 
released, cortex either bypassed the normal initial state and immediately generated 
the pattern that controls reaching or failed to generate this pattern. The difference in 
these two outcomes was probably a result of external inputs. We directly investigated 


the role of inputs by inactivating the thalamus; this perturbed cortical activity and 
disrupted limb kinematics at any stage of the movement. Activation of 
thalamocortical axon terminals at different frequencies disrupted cortical activity 
and arm movement ina graded manner. Simultaneous recordings revealed that both 
thalamic activity and the current state of cortex predicted changes in cortical activity. 
Thus, the pattern generator for dexterous arm movement is distributed across 
multiple, strongly interacting brain regions. 


Reaching, grasping and object manipulation have a central role inthe 
lives of mammals with prehensile forelimbs. Motor cortex is a key brain 
hub involved inthe control of skilled movements of the arm and hand. 
In primates and rodents, lesions of motor cortex can impair dexterity’ °, 
stimulation of cortical neurons evokes movements!” *, the activity 
of these neurons is closely linked to movement parameters!*~, and 
optogenetic perturbations have a range of effects on different forelimb 
behaviours"”””* 8, Recent work has emphasized that the motor cortex 
is a dynamical system in which neural firing rates evolve over time as 
a result of both the local cortical dynamics and external inputs from 
other areas, suchas the thalamus and other cortical regions? (Fig. 1a). 
The dynamical principles governing the cortical system, however, 
remain largely unknown. 

External inputs to motor cortex carry sensory information about the 
arm” as well as signals from the cerebellum, basal ganglia and higher 
cortical areas. The inputs are known to be important for movement 
preparation®**° * and sensory-based corrections”, but how do these 
inputs work together with local dynamics to produce motor corti- 
cal output during the execution of an unperturbed movement? One 
possibility is that once motor cortex has been set to the appropriate 
initial condition by external input, strong local dynamics generate 
the output during execution, whereas external inputs are weak; that 
is, the motor cortical dynamical system is largely autonomous dur- 
ing movement* (Fig. 1b, left). However, previous studies have shown 
movement-locked patterns in many brain areas providing input to 


motor cortex, coherence between motor cortex and these regions*”», 


and disrupted cortical activity following cerebellar cooling® or in ani- 
mal models of Parkinson’s disease”. A second possibility is that exter- 
nal inputs are needed to maintain the motor cortical dynamics ina 
pattern-generating regime, but the precise temporal pattern of these 
inputs is not critical to producing the correct output pattern*®. A third 
possibility is that local dynamics within motor cortex must receive a 
strong, time-varying input pattern to produce the appropriate output 
(Fig. 1b, right). Although firing rates in thalamus are modulated ona 
kinematic timescale”, the hypothesis that this temporal modulation 
is required for the generation of time-varying output within cortex has 
not been tested directly. Here we dissect the contribution of external 
inputs to cortical dynamics controlling dexterous behaviour by com- 
bining optogenetic perturbations of motor cortex and thalamus with 
high-density electrophysiology and movement tracking. 


Recovery from perturbed initial states 


To study cortical dynamics during dexterous movement, we trained 
mice to perform a reach-to-grasp movement, which we have previ- 
ously shown to depend on motor cortex” (Fig. Ic), and extracted arm 
kinematics and behavioural waypoints from high-speed video (Fig. 1d, 
Supplementary Video 1). Using silicon probes, we recorded activity 
from neural ensembles in contralateral forelimb motor cortex (Fig. 1d). 
Whereas the responses of individual cells were highly consistent across 
trials, we observed a wide diversity of patterns across neurons, includ- 
ing increases, decreases, and multi-phasic responses (Fig. le, f). These 
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Dynamical systems model of motor cortical control of reaching 
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patterns are consistent with previous findings in the primary motor 
cortex of nonhuman primates!"*” and rodents’* ” performing dexter- 
ous behaviours. 

If motor cortex were largely autonomous during movement execu- 
tion (Fig. 1b, left), then the initial pre-movement state of cortex, which 
is set by external inputs, should determine the subsequent evolution of 
activity during execution. Thus, inthis model, if cortex were perturbed 
to an aberrant state, it would need to be reset to the appropriate initial 
condition by external inputs before the initiation of movement, and this 
reset would probably increase the animal’s reaction time. To examine 
how cortex recovers from aberrant initial states, we performed three 
cell-type-specific manipulations of the cortical network by activating 
either inhibitory interneurons (VGAT-ChR2-eYFP mice), intratelence- 
phalic neurons (T1x3-Cre x Ai32) or pyramidal tract neurons (Sim1-Cre 
x Ai32). Each of these perturbations had a large effect on cortical activ- 
ity (Extended Data Fig. 1a, b) and blocked the initiation of reaching 
(Fig. 1g, Extended Data Fig. 1c, d). Following the release of each motor 


Fig. 1| Motor cortex as a dynamical system controlling the arm. a, The 
dynamical systems model for motor cortical control of reaching (see Methods). 
b, Left, generation of firing-rate patterns r(¢) if motor cortex were driven by 
strong recurrent dynamics A(r(d), with external inputs u(t) exerting a limited 
influence and not necessary for pattern generation. Right, generation of firing- 
rate patterns if motor cortex were dependent on strong temporally patterned 
external inputs, u(f).c, Experimental setup. Head-fixed mice reached fora 
pellet of food following an acoustic cue during recording and optogenetic 
perturbation of cortical activity. d, Raw video, electrophysiological recording 
and mouse behaviour ina single trial. Three-dimensional hand trajectories and 
the timing of each waypoint in the behavioural sequence were extracted from 
video using computer vision methods. Sup, supination. e, Spike raster plots 
and peri-event time histograms for four example neurons recordedind, 
centred onlift. Numbers indicate the maximum value on the yaxis, in spikes 
per second. f, Average z-scored firing rates (FR) and mean firing rates for all 
motor cortical neurons (n=19 mice, n=39 sessions and n= 843 neurons). 
During prehension, most neurons exhibited increases (39%) or decreases (37%) 
in spike counts around lift (two-sided rank sum test with Benjamini-Hochberg 
correction, g< 0.05). g, Distribution of lift times on control (yellow), laser plus 
cue (blue) and laser-only (magenta) trials for VGAT-ChR2-eYFP mice (n=5 mice, 
n=7 sessions). Cue starts at time O, and the blue bar indicates the laser-on 
epoch. h, Neural population activity from lift -100 ms to lift +425 msin control 
(yellow) and post-laser (blue) reaches in VGAT-ChR2-eYFP mice, obtained using 
trial-averaged principal component analysis (PCA) (n=4 mice, n= 6 sessions 
andn=144 neurons). 


cortical perturbation, we frequently observed kinematically normal 
post-laser reaches (Extended Data Fig. 1c-f). The reaches occurred 
with a shorter reaction time than in control trials following the VGAT 
and TIx3 perturbations (Extended Data Fig. le), contrary to the predic- 
tions of the autonomous model, and they also occurred following laser 
stimulation in the absence of a cue (Fig. 1g, Extended Data Fig. 1c—e).In 
post-perturbation reaches, the neural population activity, estimated 
using dimensionality reduction, did not return to the initial state 
observed in control trials, but immediately generated patterns that 
largely recapitulated those observed in control trials (Fig. 1h, Extended 
Data Figs. 1g, 2, Supplementary Videos 2, 3). Furthermore, it was pos- 
sible to decode the post-laser hand trajectories using a decoder trained 
oncontrol trials (Extended Data Fig. 3). This suggests that cortex does 
not need to returntoa specific pre-movement state, as required by the 
autonomous model (Fig. 1b, left). 

In the dynamical systems view, the contribution of local dynamics 
depends only onthe current state of cortex. We observed that silenc- 
ing motor cortex in VGAT-ChR2-eYFP mice fixes motor cortex to a 
constant state across trials (Extended Data Fig. 4a). Thus, unless the 
network is extremely sensitive to small differences in initial state** (as 
inachaotic system), trial-to-trial variability in neural activity following 
the release of the laser will reflect variability in external inputs. When 
we compared trials on which a post-laser reach occurred with trials 
with no reach (Fig. 2a), we found that the two trial types started in the 
same initial state, but rapidly diverged after the release of the laser 
(Fig. 2b, c, Extended Data Fig. 5c), suggesting a difference in external 
input between reach and no-reach trials (Fig. 2d). We estimated this 
difference in inputs by subtracting the firing-rate derivatives of the 
two conditions (Fig. 2e; see Methods). The result of this experiment 
suggests that external inputis critical to producing the correct reach- 
ing pattern. 


Input is required for pattern generation 

Ifthe motor cortex requires external input throughout movement exe- 
cution, then blocking or interrupting the input pattern should perturb 
both motor cortical activity and arm kinematics. To test this directly, 
we implanted optical fibres above motor thalamus contralateral to the 
arm in VGAT-ChR2-eYFP mice. This enabled us to activate inhibitory 
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Fig. 2| Divergence of neural trajectories from the same initial state. 

a, Average hand trajectory for trials with (blue) and without (red) post-laser 
reaches in VGAT-ChR2-eYFP mice (n=4 mice and n=6 sessions). b, Neural 
population activity aligned to the end of the laser for trials with (blue) and 
without (red) a post-laser reach (n=128 neurons). Time limits are 250 ms before 
the end of the laser to 250 ms after the end of the laser. Black dot indicates 
baseline activity before the start of the trial. c, Spatial (left) and neural (right) 
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distance to target, centred onthe end of the laser, for trials with (blue) and 
without (red) post-laser reaches. d, Schematic illustrating that a divergence in 
neural trajectories from the same initial state implies a difference in external 
inputs. e, Estimated difference in external inputs between reach and no-reach 
conditions (see Methods). The divergence of the traces shortly before the end 
of the laser is owing to smoothing (Extended Data Fig. 5a—d). 
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Fig. 3 | External inputs are required for the motor cortical pattern during 
reaching. a, Experimental schematic. Placement of fibres over motor cortex 
and thalamus (thal). b, Distribution of lift times on control (yellow) and 
thalamicinactivation (green) trials; n=3 mice (VGAT-ChR2-eYFP) andn=7 
sessions. Right inset shows the probability of a lift within the first 500 ms 
following the cue for control and thalamus inactivation trials. c, Hand position 
inthe upper direction centred onlift on control trials (light yellow) and mid- 
reach thalamic inactivation trials (black; green indicates laser on) for asingle 
dataset. Dots indicate the start of the laser. Data from all mice (n=4 mice and 
n= 6 sessions) are shown in Extended Data Fig. 6. d, Lift times for control trials 
(yellow), cortical inactivation (blue) and sequential inactivation of cortex 
(ctx) and thalamus (green); n=3 mice (VGAT-ChR2-eYFP) andn=4 sessions. 

e, Average firing rate z-scores for all recorded neurons under inactivation of 
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cortex alone (left) and sequential inactivation of cortex and thalamus (right); 
n=3mice,n=4 sessions andn=127 neurons. f, Population activity following 
the end of cortical inactivation for trials with cortical inactivation only (blue) 
and inactivation of thalamus after cortex (green). Plotting limits start 500 ms 
before the end of cortical inactivation and finish 500 ms after the cortical 
inactivation (blue trace) and 500 ms after the thalamic inactivation (green 
trace). The divergence of the trajectories shortly before the end of cortical 
inactivation results from smoothing (Extended Data Fig. Se-h), and inhibitory 
interneurons were excluded. g, Schematic illustrating that the divergence from 
the cortex-inactivated state reveals differences in input. h, Estimated 
difference in external inputs following the end of cortical inactivation between 
thalamus inactivated (u_,,(¢)) and not inactivated (u,,,(0) conditions. The 
difference is given by u,,,,(f) - u_y,(0. 
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Fig. 4 | Modification of the temporal pattern of inputs perturbs cortical 
activity and movement. a, Experimental design. Mice expressing ChR2 in 
thalamic neurons performed the task during recording of cortical activity and 
optogenetic stimulation of thalamocortical terminals. b, Left, example average 
hand trajectory on control trials and stimulation trials with stimulation 
frequencies of 4,10 and 40 Hz froma single experimental session. Data fromall 
mice (n=3 mice and n=6 sessions) are shown in Extended Data Fig. 9. Time 
limits are cue -250 ms to cue +1,000 ms. Right, average neural activity (first two 
principal components) in the same session. Dots indicate the end of the 
trajectory at cue +1,000 ms. c, Left, average Euclidean distance from the hand 
position in each stimulation condition to the control hand position at the same 
time point. Each curve shows a single experimental session (n=3 mice andn=6 
sessions). Middle, Euclidean distance from the neural state in the stimulated 
conditions to the neural state in the control condition. Right, distance from 
hand trajectory to control versus distance from the neural trajectory to 
control. 


terminals in this region, which has been shown to suppress activity 
in the projections to motor cortex”! (Fig. 3a). Thalamic inactivation 
at the start of the trial blocked the initiation of coordinated reaching 
(Fig. 3b), and inactivation in the middle of the reach interrupted the 
progression of the hand to the target (Fig. 3c, Extended Data Fig. 6). 
These results demonstrate that external inputs are required to control 
the hand throughout the entire movement. 

To isolate the effect of inputs from local dynamics, we first set the 
motor cortical network—and thus the contribution of local dynamics— 
to the same initial state by silencing cortex in VGAT-ChR2-eYFP mice. 
Then, when we removed the motor cortical inactivation, we allowed 
the network to recover in some trials, but immediately silenced the 
thalamus in other trials. This thalamic inactivation blocked reaching 
following the removal of motor cortical suppression (Fig. 3d). Thalamic 
inactivation did not act by merely silencing motor cortical spiking; fir- 
ing rates during this epoch fluctuated extensively (Fig. 3e, Extended 
Data Fig. 7c). The mouse frequently reached to the target after the 
removal of thalamic inactivation (Fig. 3d, Extended Data Fig. 7a). These 
post-thalamic-inactivation reaches were generated by the same neural 
pattern that drove reaching in control trials and post-motor-cortical- 
inactivation trials (Extended Data Fig. 7b). Population activity with and 
without thalamic inactivation began with the same initial state, but 
rapidly diverged after the end of motor cortical suppression (Fig. 3f, 
Extended Data Fig. 5g, Supplementary Video 4). When thalamus was 
inactivated, the cortical trajectory exhibited a brief transient and then 
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Fig. 5 | Relationship between population activity in motor thalamus and 
motor cortex. a, Experimental setup. Spiking activity was simultaneously 
recorded in motor cortex witha four-shank, 64-channel probe and in thalamus 
with a384-channel Neuropixels probe (n=3 mice and n=3 sessions). The 
thalamic region projecting to motor cortex was identified by optogenetic 
stimulation of thalamocortical terminals. b, Population trajectories for 
thalamus (left, green) and cortex (right, magenta) obtained with trial-averaged 
PCA.c, Single-trial population activity in thalamus (top) and cortex (middle), 
along with hand position (bottom). d, Goodness-of-fit of regression models 
(coefficient of determination, R?). The dependent variable was the derivative of 
the cortical population state for the first three principal components. The 
independent variable was the cortical state (magenta), the thalamic state 
(green) or both (grey) for the first N principal components, where Nwas varied 
between 1and 10. The top row shows the goodness-of-fit for trial-averaged 
data, and the bottom for single-trial data. 


converged to a new fixed point, which could reflect the influence of 
local dynamics. Because the difference between the two trial types 
was whether or not we silenced thalamus, the rapid divergence of the 
corresponding trajectories from the cortex-inactivated state resulted 
from differences in external inputs, rather than extreme sensitivity 
to initial conditions (Fig. 3g). We estimated the difference in external 
inputs between the two conditions (see Methods), revealing a strong 
influence of inputs on the first principal component of neural activity 
(Fig. 3h). 
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Graded disruption of the input patterns 


Our thalamicinactivation results showed that inputs are necessary to 
produce the correct cortical output. We next investigated whether a 
precise temporal pattern in these inputs is required (Fig. 1b, right). To 
address this question, we corrupted the input pattern by expressing 
ChR2 in thalamic neurons and stimulating thalamocortical terminals at 
frequencies of 4 Hz, 10 Hz and 40 Hz while recording from motor cortex 
during reaching (Fig. 4a). Stimulation of the terminals entrained spiking 
activity in motor cortex (Extended Data Fig. 8a), but produced relatively 
small changes in mean postsynaptic firing rates (Extended Data Fig. 8b) 
compared with the changes induced by direct stimulation of cortical 
neurons (Extended Data Fig. 1b). Stimulation partially blocked the 
initiation of reaching, and the blocking effect was dependent on the 
frequency of the stimulus (Extended Data Fig. 8d; P=6.7 x10°°, F=18.0; 
one-way ANOVA). At 4 Hz stimulation, the mouse was able to reach to 
the pellet, and the hand and neural trajectories largely recapitulated the 
control trajectories (Fig. 4b, Extended Data Fig. 8c, 9, Supplementary 
Video 5), although the hand trajectory was transiently perturbed at the 
pulse times (Extended Data Fig. 9a). At 10 Hz stimulation, the mouse 
often initiated amovement, but was usually unable to reach to the pel- 
let, andinsome cases, the limb oscillated at the stimulation frequency. 
At 40 Hz stimulation, the initiation and execution of reaching were 
severely impaired. The neural and hand trajectories were more differ- 
ent from the control trajectories at higher stimulation frequencies 
(P=0.0040 and 0.0052, F=8.2 and 7.6, respectively, one-way ANOVA; 
Fig. 4c, left and centre), and the magnitude of the difference was corre- 
lated for the neural and hand activity (Spearman’s p= 0.94, P=1.7 x10; 
Fig. 4c, right). Thus, our data suggest that the temporal pattern of inputs 
directly influences cortical activity and hand kinematics. 


Thalamocortical coupling 


We have shown that time-varying thalamic inputs are required for corti- 
cal pattern generation. We next investigated the contribution of these 
inputs relative to local dynamics. We simultaneously recorded activity 
in motor cortex and motor thalamus using a384-channel Neuropixels 
probe* and verified the targeting of thalamus with optogenetic tagging 
and histology (Extended Data Fig. 10a—c). Most thalamic neurons were 
modulated around movement onset (Extended Data Fig. 10d; increases 
inn=73 out of 98, decreases in n=11 out of 98), consistent with previous 
reports in both rodents?” and nonhuman primates*°*!”, Population 
activity in thalamus and cortex exhibited strong modulation in trial- 
averaged (Fig. 5b) and single-trial activity (Fig. 5c). 

Inthe dynamical systems model, the derivative of the cortical state is 
afunction of both external input and the current cortical state (Fig. 1a). 
Thus, we numerically differentiated the cortical population activity 
(single-trial or trial-averaged) and regressed it on the cortical state, the 
thalamic state, or both. If local dynamics were strong during movement 
execution, then the model using the cortical state as the independ- 
ent variable should provide a better fit. However, if inputs were much 
stronger than local dynamics, then the model using thalamic state as 
the independent variable should provide a better fit. We found that 
the cortical derivative was predicted by both variables, with either 
thalamic state or cortical state providing a better fit, depending on 
the individual mouse (Fig. 5d). This suggests that the cortical state 
does not evolve as a purely autoregressive process or simply integrate 
thalamic input; rather, both local dynamics and inputs contribute to 
generating the cortical pattern. 


Discussion 


Studies in nonhuman primates have emphasized the role of external 
inputs in setting up a specific initial state in motor cortex that allows 
the appropriate activity pattern to unfold during the execution of a 
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movement’ °, though this state may be bypassed under certain condi- 


tions**. Under this view, the subsequent evolution of cortical activity 
may be strongly guided by autonomous dynamics, unless the cortex 
needs to correct for errors or perturbations‘. After we optogenetically 
set cortex to aberrant initial states, it rapidly produced the appropri- 
ate pattern for reaching without returning to the initial state observed 
on control trials, provided the inputs were intact. This suggests that 
a specific initial state is not required for cortical pattern generation. 
Contrary to the predictions of the autonomous model (Fig. 1b, left), 
silencing or stimulating thalamus severely disrupted cortical pattern 
generation and arm movement. Thus, temporally patterned inputs are 
critical for producing the cortical output pattern during movement 
execution (Fig. 1b, right). 

Our data reveal experimental conditions in which the temporal pat- 
tern of external inputs drives the pattern in motor cortex, and in which 
this output pattern is required to skilfully control the arm and hand. 
These findings complement previous studies, which have suggested 
that dexterous behaviours that require precise coordination of the 
fingers and interaction with objects, such as grasping’ *"*”””°, may be 
one class of cortically dependent movement. The execution of other 
forelimb behaviours—such as locomotion”, pulling a grasped lever™* 
or timing a sequence of lever taps*8—does not appear to require cortex. 
The source, modality, and function of inputs may also differ across 
species; it is possible, for instance, that long-range inputs from other 
cortical regions may be more important in primates, which havea highly 
differentiated cortical network for planning and executing reaching and 
grasping”’. Species differences in sensorimotor delays*’ and moment 
of inertia of limb segments, which are related to body size, could also 
affect the timescale on which peripheral inputs influence cortex. 

We have shown that the cortical pattern that controls reaching can be 
produced only iftemporally patterned inputs are maintained through- 
out the entire movement. Thus, while motor cortex is a bottleneck for 
descending motor commands, its activity patterns during movement 
are crucially moulded by its embedding within a vast and distributed 
network, which incorporates muscles, the sensory periphery, subcorti- 
cal regions and other cortical areas. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded 
to allocation during experiments and outcome assessment. 


Dynamical systems model for motor cortical control of reaching 
In this model??, depicted in Fig. 1a, the firing rates of motor cortical 
neurons, r(t), change as a result of two distinct influences. First, the 
local architecture intrinsic to motor cortex imposes a change, h(r(t)), 
based onthe current firing rates. Second, brain regions outside motor 
cortex provide external input, u(t). This input is not identical to the 
firing rates of the neurons in upstream brain regions; rather, it repre- 
sents the effect those upstream firing rates have on the firing rates of 
postsynaptic neurons in motor cortex. The firing rates evolve according 
tor’(d =hA(r(d) + u(0). These firing rates control the muscle activation, 
m(¢) = G(r(0), through circuits in the lower motor centres, including 
the spinal cord. Inturn, the muscles change the positions and velocities 
of the joint centres, x(¢), through a function describing the musculo- 
skeletal mechanics: x’(t) = F(m(t),x(t)). Delayed sensory feedback from 
the arm ascends into the brain and influences the external inputs u(0), 
closing the loop. 

The additive interaction between inputs and local dynamics makes 
the model intuitive and tractable for estimating the difference in exter- 
nalinput contributions across experimental conditions (Figs. 2e, 3h). 
However, it is likely that the network dynamics are nonlinear. That 
is, the evolution of cortical activity over time might be described by 
r’(t) = @(r(O,u(O), and thus local connectivity in motor cortex may 
amplify or distort simple changes in input, including changes in tonic 
drive. Modelling such complex, nonlinear interactions will require 
powerful new analysis tools, such as those that use deep learning to 
capture cortical dynamics”. 


Behavioural task and video analysis 

Mice of either sex aged 8-25 weeks were fitted with head posts, food 
restricted, and trained to reach for food pellets, as described previ- 
ously”®. All data in this manuscript, including those from the behav- 
ioural experiments, are previously unpublished. WaveSurfer (http:// 
wavesurfer.janelia.org/) was used to control the behavioural stimuli. 
Video of the behaviour was recorded at 500 Hz using BIAS software (IO 
Rodeo, available at https://bitbucket.org/iorodeo/bias) and two high- 
speed cameras (PointGrey, Flea3), which were calibrated to allow 3D 
triangulation of hand position (Caltech Camera Calibration Toolbox 
for Matlab; http://www.vision.caltech.edu/bouguetj/calib_doc/). Two 
types of information were extracted from video: ethograms labelling 
the frames in which lift, hand open, grab, supination, hand at mouth, 
and chew occurred, obtained using the Janelia Automatic Animal Behav- 
ior Annotator (https://github.com/kristinbranson/JAABA), and the 
position of the hand in space, obtained using the Animal Part Tracker 
(APT) (https://github.com/kristinbranson/APT). All procedures were 
approved by the Institutional Animal Care and Use Committee at Janelia 
Research Campus (protocols 13-99, 16-139 and 19-177). 


Automatic behaviour characterization 

Using an adaptation of the Janelia Automatic Animal Behaviour Annota- 
tor (JAABA), we trained automatic behaviour classifiers that input infor- 
mation from the video frames and output predictions of the behaviour 
category-lift, hand-open, grab, supination, at-mouth, and chew. We 
adapted JAABA to use Histogram of Oriented Gradients” and Histo- 
gram of Optical Flow” features derived directly from the video frames, 
instead of features derived from animal trajectories. The automatic 
behaviour predictions were post-processed as described previously”® 
to find the first lift-hand-open-grab and supination-at-mouth-chew 
sequences. For the mid-reach thalamic perturbation experiments 
(Fig. 3c, Extended Data Fig. 6), we used the last lift detected before 


laser onset for aligning data. Tracking of hand position was performed 
using the APT software package (https://github.com/kristinbranson/ 
APT). Hand position was annotated manually for aset of training frames, 
and the cascaded pose regression™ algorithm was used to estimate the 
position of the hand in each remaining video frame. For the thalamus 
recordings and stimulation (Figs. 4,5), APT was used with the Deep- 
LabCut algorithm®, and lifts were detected using threshold crossings 
of the upward hand velocity (threshold of 75 mms). 


Electrophysiological recordings in motor cortex 

Neural recordings were performed using the Whisper acquisition sys- 
tem (Janelia Applied Physics and Instrumentation Group) and 64-chan- 
nel silicon probes (NeuroNexus A4 x 16-Poly2-5 mm-23 s-200-177-A64 
or Janelia 4 x 16 probes). These probes consisted of four shanks with 16 
contacts at the tip of each, over a depth of 345 ppm (NeuroNexus) or 320 
pum (Janelia probes). The electrode contacts were coated with PEDOT 
to lower their impedance, and in some cases, the tip of the probe was 
sharpened witha spinning hard disk to enable easier insertion. Onthe 
day before the experiment, a small craniotomy was made over motor 
cortex contralateral to the limb, anda stainless steel reference wire was 
implanted in visual cortex. During the recording session, the probe tips 
were positioned at approximately bregma +0.5 mm, 1.7 mm lateral, 
and slowly lowered to a depth of -900 pm from the cortical surface, 
and asilicone elastomer (Kwik-Sil, World Precision Instruments) was 
applied to seal the craniotomy. At the end of the session, the probe 
was removed, and the craniotomy was re-sealed with silicone to allow 
asubsequent session on the following day. Signals were amplified with 
a gain of 200 and digitized to 16 bits at 25-50 kHz, and spike sorting 
was performed with JRClust (https://github.com/JaneliaSciComp/ 
JRCLUST)*. 


Optogenetic manipulations 

Cell-type-specific expression of ChR2 for cortical perturbations was 
achieved by either using VGAT-ChR2-eYFP mice” expressing ChR2 in 
inhibitory neurons (Slc32a1-COP4*H134R/eYFP, The Jackson Labora- 
tory), or by crossing a Cre driver line to a Cre-dependent ChR2 reporter 
mouse, Ai32*° (Rosa-CAG-LSL-ChR2(H134R)-eYFP-WPRE, The Jack- 
son Laboratory). Experiments were performed in VGAT-ChR2-eYFP 
(n=13), Tg(TIx3-Cre)PL56Gsat x Ai32 (n = 3), Tg(Sim1-Cre)KJ18Gsat X 
Ai32 (n=3), or Tg(Rbp4-Cre)KL100Gsat x Ai32 (n= 2) mice. Rbp4-Cre 
x Ai32 mice were used for control trials only, as they provide poorer 
marking of pyramidal tract neurons than Sim1-Cre x Ai32. Experiments 
were attempted in three additional mice (VGAT-ChR2-eYFP, n =2, and 
Tg(Sim1-Cre)KJ18Gsat x Ai32, n=1), but were aborted owing to the poor 
quality of the electrophysiological signals. An optical fibre (200 pm or 
400 um, NA 0.39, Thorlabs) was coupled to a 473-nm laser (LuxX 473-80, 
Omikron Laserage) and positioned 2-4 mm over motor cortex in the 
head fixation apparatus, as described previously”®. Five VGAT-ChR2- 
eFYP mice were implanted with an optical fibre over motor thalamus 
(bregma -—1.1mm, lateral 1.3 mm, depth3.3 mm). A blue light-emitting 
diode array was directed at the animal’s eyes throughout the session in 
order to mask the laser stimulus. Three trial types were used: control 
trials, in which the cue was presented with no laser stimulation; laser + 
cue trials, in which both were presented; and laser-only trials, in which 
the laser was turned on without a cue or food administration. A two- 
second laser stimulus (40 Hz sine wave) was initiated synchronously 
with the cue for VGAT-ChR2-eYFP mice, or 200 ms before cue onset for 
TIx3-Cre x Ai32 and Sim1-Cre x Ai32 mice. Laser power was calibrated to 
the minimum level necessary to block reaching in probe experiments in 
the final days of training; this ranged from 10-50 mW atthe fibre tip for 
VGAT mice, and 0.5—6 mW for TIx3 and Sim1 mice. In mid-reach inter- 
ruption experiments, a region of the video frame between the average 
lift and hand open locations was identified using BIAS software, anda 
contrast change in this region was used to open the laser shutter for 2s. 
All optogenetic perturbations were unilateral, on the side opposite the 


reaching arm. For mice with simultaneous recordings in motor cortex 
and thalamus (n = 3), ChR2 expression was driven in thalamus either 
by injecting a ChR2 reporter mouse Ai32 with AAV-2/9-Syn-Cre (n= 2), 
or by injecting a Vglut2-ires-Cre mouse (The Jackson Laboratory) with 
AAV-2/5-EFla-DIO-hChR2(H134R)-mCherry (n=1). For mice with cortical 
recording during the stimulation of thalamocortical terminals (n=3), 
we used Vglut2-ires-Cre mice (The Jackson Laboratory) with AAV-2/5- 
EFla-DIO-hChR2(H134R)-mCherry injected into motor thalamus, and 
stimulus trains consisting of 5 ms pulses at 4 Hz, 10 Hz or 40 Hz for 2s 
were initiated synchronously with the cue. 


Simultaneous electrophysiological recordings in motor cortex 
and thalamus 

Mice expressing ChR2 in motor thalamus were head-fixed and trained 
onthe reaching task. On the day before recording, craniotomies were 
performed to allow access to motor cortex (bregma +0.5 mm, 1.7 mm 
lateral) and thalamus (bregma —2.3 mm, 1.3 mm lateral), and were sealed 
with silicone elastomer (Kwik-Sil). On the day of recording, a four-shank, 
64-channel silicon probe was inserted vertically into motor cortex at 
a depth of 800 pm, and a 960-site, 384-channel Neuropixels probe* 
(Option 3A) was inserted with a caudal tilt of 23 degrees from vertical 
to target thalamus. Once the thalamic probe was in position at ~4.6 
mm fromthe surface, a pulse train of 473 nm light (40 Hz, 5 ms pulses, 
2-12 mW) was applied through a fibre-coupled laser positioned over 
motor cortex. This pulse train stimulated thalamic terminals in motor 
cortex, and enabled identification of a region of the Neuropixels probe 
positioned in athalamic region projecting to motor cortex (Extended 
Data Fig. 10a). The position of the Neuropixels probe was further veri- 
fied by examining the probe track in a histological section (Extended 
Data Fig. 10b). The behavioural task was runas in the cortical recording 
experiments. Spike sorting was performed with Kilosort2® (https:// 
github.com/MouseLand/Kilosort2) and Phy GUI (https://github.com/ 
kwikteam/phy). 


Data analysis 

Peri-lift firing rates. For each neuron, lift-centred spike trains were 
smoothed with a Gaussian kernel (o= 50 ms) and averaged across tri- 
als and z-scored. Lift modulation was assessed using a rank sum test 
comparing the raw spike counts in a500-ms window centred at lift 
+200 ms with counts ina 500-ms window centred at lift -750 ms. Mul- 
tiple comparisons were corrected using the Benjamini—-Hochberg false 
discovery rate framework (q< 0.05), and all statistical tests inthe study 
were two-sided (Fig. 1f, Extended Data Fig. 10d). 


Trial-averaged PCA. Peri-lift firing rates were extracted by smooth- 
ing the spike trains with a Gaussian kernel (o = 25 ms for Figs. 2b, 3f, 
o=100msin Fig. 4b and Extended Data Fig. 9c, and o=50 msin the other 
analyses), z-scored, and averaged within each trial type for each neu- 
ron. The window used was -100 ms to 425 ms around each lift (Fig. 1h, 
Extended Data Fig. 1g), -100 ms to 350 ms around lift (Fig. 5b, Extended 
Data Fig. 7b), -250 ms to +250 ms from the end of cortical inactiva- 
tion (Fig. 2b), -500 ms to +500 ms of the end of the cortical inactiva- 
tion (Fig. 3f, blue), or -500 ms from the start of cortical inactivation 
to +500 ms from the end of the thalamic inactivation (Fig. 3f, green). 
Interneurons that increased their firing during cortical silencing were 
excluded in Figs. 2b, 3f, and one session in which no post-laser reaches 
occurred was excluded in Fig. 2. Principal component coefficients were 
fit using control trials only for lift-centred analyses (Fig. 1h, Extended 
Data Fig. 1g), or cortex inactivation only (Fig. 3f), and scores were then 
extracted for all trial types. For the lift-aligned PCA analyses, acontrol 
trial was included if lift and grab occurred within 500 ms following 
the cue, and a laser trial was included if lift and grab occurred within 
500 ms of the end of the laser. We excluded one VGAT dataset and one 
Sim] dataset from this analysis, as they had only 0 and 1 post-laser trial 
meeting this criterion, respectively. In order to examine the time of the 


divergence of neural trajectories from the cortex-inactivated state, we 
also smoothed the laser-aligned data with causal filters that used spikes 
only from the past (Extended Data Fig. 5). This resulted in trajectories 
that were causal, but not differentiable (as required by the analyses 
in Figs. 2e, 3h), so we used Gaussian smoothing for all other analyses 
(Figs. 1h, 2b, 3f, 4b, 5b, Extended Data Figs. 1g, 2, 5, 7b, 9c). 


Single-trial PCA. Spike trains were smoothed with a Gaussian kernel 
(o=100 ms) andz-scored using the baseline mean and standard devia- 
tion, and principal components were extracted from trial-averaged, lift- 
aligned data. Z-scores from individual trials were then projected onto 
these components to obtain single-trial population activity (Fig. 5c, 
Extended Data Fig. 8c, lower two rows). Trial-averaged data in Fig. 4b 
(right) and Extended Data Fig. 9c were obtained by averaging these 
single-trial estimates (Fig. 5c, Extended Data Fig. 8c). 


Gaussian-process factor analysis and neural distance. Neural popu- 
lation activity was reduced to a five-dimensional latent variable space 
using Gaussian-process factor analysis (GPFA)® (bin size 20 ms). A 
region of the dataset in which the recordings were stable was selected 
by finding the time interval and subset of neurons that maximized the 
quantity (usable neurons) x (usable seconds). The target spatial and 
neural states were defined using the three-dimensional position of the 
hand, and the five-dimensional latent variable representation of motor 
cortical activity obtained using GPFA, respectively. In both cases, the 
states were sampled at grab times, and the target state was defined to 
be the central location of the grab-triggered states, computed using 
convex hull peeling. Only 50% of the control trials were used to calculate 
the target states. The Euclidean distance from the target was then com- 
puted for each trial and time point, and the resulting distance curves 
were centred onthe end of the laser (Fig. 2c, Supplementary Videos 2, 3). 


Estimation of difference in external input contributions. We used the 
dynamical systems model, r’(¢) = A(r()) + u(O, to calculate the differ- 
ence inthe contribution of external inputs between conditions follow- 
ing the end of cortical inactivation in VGAT-ChR2-eYFP mice. Suppose 
we have two types of trial, A and B. In Fig. 2e, these types correspond 
to trials in which a post-laser reach occurred or did not occur, and in 
Fig. 3h, they correspond to trials in which thalamus was perturbed fol- 
lowing the end of cortical inactivation and trials in which the thalamus 
was not perturbed. Let r,(0) and r,(0) denote the population activity 
(principal component scores) on these trial types, and suppose the 
cortical inactivation ends at t=0. 

For t< 0, we have fixed r,(t) =r, (0) = O. Let € > O. According to the 
dynamical systems model, 


ty (0) ~ ty (0) = (A(ta(2)) + ug (0) - (ACR (0) + Ug(2)) 


A(t, (0)) = A(O) = A(R (O)) 
Thus, for small ¢, A(r,(€)) = A(rg(€)), SO 


ua(€) — Up(€) = ty (€) — th (€) - (Ata (€)) — A(ig(€))) = ty (€) - te (©) 


Thus, subtracting the derivative of population activity for the two 
conditions allows us to estimate the difference in the contributions 
of external inputs shortly after the end of cortical inactivation. This 
approximation relies on two assumptions. First, the firing rate deriva- 
tive is additive in the autonomous and input-dependent contributions. 
Second, is small; that is, h(r(O)) = A(O) must be close to A(r(s)). 


Direction of neural trajectories. This analysis addresses the question 
of whether neural activity for post-laser reaches first returned to the 
control baseline state. For each perturbation (VGAT, TIx3 and Sim1), 
we estimated the population activity on control reaches, r,(t), and 
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post-laser reaches, r,(0), using the first six principal component scores. 
These captured 98%, 99% and 97% of the variance in control reaches for 
VGAT, TIx3 and Sim1, respectively. At each point in time around the lift, 
we computed the derivative and divided by the norm of the derivative. 
This yielded the direction in population activity for control and laser 
reaches (see Extended Data Fig. 2a), shown for the first two principal 
components by the yellow and blue arrows in Extended Data Fig. 2b-d. 
We compared the similarity between the control and laser directions at 
each point in time by computing the inner product between them. This 
yielded the yellow curves in the right panels in Extended Data Fig. 2a—d. 
We then computed the difference between the population activity in 
the laser condition and the initial control state and normalized it to 
have unit length; this resulted in the direction from the state in the 
laser condition to the initial control state at each point in time, shown 
by the red arrows in the left panels of Extended Data Fig. 2b-d. We then 
found the similarity between this direction and the direction of the laser 
trajectory by computing their inner product, shown in the red curves 
in the right panels of Extended Data Fig. 2b—d. This analysis suggests 
that following the perturbations, the neural population recapitulated 
the pattern for reaching without returning to the control initial state. 


Neural decoding. A linear filter was designed to decode 3D hand veloci- 
ties from neural activity during reaches. The decoded hand velocities 
were then used as proxies for the components of neural activity relevant 
to movement in order to assess the effect of different types of pre-lift 
perturbations (VGAT, TIx3, Sim1) on the neural activity during reach 
(Extended Data Fig. 3). 

For decoding, both the 3D hand trajectories (S00 Hz, see ‘Behavioural 
task and video analysis’) and the multiunit neural activity (counts of all 
detected spikes on each recording channel with 2-ms bins, no single 
unit identification) were smoothed with the same Gaussian kernel 
(o=25 ms). Velocities were numerically derived from smoothed hand 
trajectories using a central difference filter of order 8. Firing rates were 
z-scored with respect to the activity at rest, computed combining 1.5-s 
windows preceding the start of each trial. Channels with mean absolute 
z-scores greater than 100 during movement (for example, for units 
with standard deviations very close to zero) were excluded from fur- 
ther analysis. The z-scores of neural activity were then processed with 
PCA for denoising and dimensionality reduction, with the principal 
components computed from the lift-aligned trial-averaged activity in 
control trials (window -100 ms to 300 ms around lift). The decoding 
results with this method were better than with any alternative choices 
we tried (Extended Data Fig. 3m, n), which included: (1) computing 
the PCA components from the data matrix obtained concatenating 
all lift-aligned control and post-laser neural trajectories (after end of 
perturbation) rather than from trial-averaged control activity, (2) using 
only identified single-unit activity instead of multiunit activity, (3) using 
GPFA instead of PCA for dimensionality reduction, and (4) smoothing 
the firing rates with different Gaussian kernels (ranging from o= 2ms 
to 0=500ms) before Z-scoring and dimensionality reduction. 

The decoder uses time-invariant coefficients to decode the hand 
velocity at any given timeas a linear combination of the 15 most recent 
samples of PCA-reduced neural activity (hence up to 28 ms inthe past). 
The decoder coefficients were obtained by regressing (ordinary least 
square sense, implemented via QR factorization in Matlab) velocity 
data against PCA-reduced neural data in a subset of trials (training 
set). Hand velocities were decoded during reaches in which the lift- 
to-grab sequence occurred within the first 500 ms after the cue (for 
control trials) or within the first 500 ms after the end of perturbation 
(for laser trials). Sessions without post-perturbation reaches satisfying 
these criteria (1 Vgat, 1 Sim1 session) or with only one such reach (1Sim1 
session) were excluded from the neural decoding analysis. The decod- 
ing window was extended by 50 ms before lift and 50 ms after grab, to 
include the beginning and the end of the reaching movement. Within 
each dataset, the training set used in fitting the decoder coefficients 


was comprised of a majority of control trials (three fifths). An additional 
subset of control trials (one fifth) was used to cross-validate the optimal 
choice of the number of PCA components to use in the decoder, as 
follows. For any choice of number of components to keep, a decoder 
was computed from the training control trials and used to decode the 
hand velocities in the validation control trials. The performance of 
each decoder was compared in terms of mean squared error between 
observed and decoded velocities. The minimum number of PCA dimen- 
sions that guaranteed performance within 1% of the overall minimum 
mean squared error across all choices was selected (using this 1% margin 
considerably reduced the dimensionality of the data in some sessions 
without compromising decoding performance). Finally, the selected 
decoder was used to decode hand velocities in the remaining testing 
subset of control trials (one fifth), which were not used for training or 
cross-validation, and on the laser trials in which reaches occurred at 
the end of the perturbation. 

We found that the decoder performance was not uniform across the 
three directions (forward, lateral, upward), but was consistently worse 
inthe lateral direction than in the other two directions. This may reflect 
the smaller extent to which the reaching trajectories sample lateral 
movements of the hand, or may reflect an insufficient sampling of the 
neural population of motor cortical cells responsible for the lateral 
movement of the hand. We thus showed the decoding performance in 
each direction separately, and primarily used the coefficient of deter- 
mination (R”) between decoded and observed hand velocities in each 
direction asthe summary performance metric, except for the compari- 
sons of decoding quality across trial types (Extended Data Fig. 3c, g, 
k) and choices of the decoder (Extended Data Fig. 3m, n) in which all 
directions were pooled together before computing the coefficient of 
determination. In most of the datasets, the velocity decoding accuracy 
was higher for control testing trials than laser trials. Nevertheless, in 
most sessions the decoder trained only on control trials still performed 
reasonably well on laser trials (at least in some of the directions). 


Analysis of simultaneous recordings from thalamus and cortex. 
For analysis of trial-averaged cortical and thalamic firing rates, and 
for the identification of lift-modulated thalamic units, analyses were 
performed as described above for the cortical recording experiments 
(Fig. 5b, Extended Data Fig. 10d). For single-trial analysis of population 
activity, lift-aligned thalamic and cortical spiking was smoothed witha 
o=100ms Gaussian kernel and z-scored using the baseline firing rates 
from lift -500 ms to lift -100 ms. Activity was first averaged across trials, 
and PCA was performed onthe trial-averaged data matrix to obtain the 
principal component coefficients. Next, activity from individual trials 
was multiplied by this coefficient matrix, resulting in estimates of the 
single-trial population trajectories for cortex and thalamus (Fig. 5c). 

According to the dynamical systems model, the derivative of the 
cortical state can be expressed as r’(t) = A(r(t)) + u(0). In the linear 
case, considering inputs from the thalamus, this corresponds to 
r’(t) = Ar(t) + Bs(t), where s(t) is the thalamic state. Because we meas- 
ured cortical and thalamic activity simultaneously, we attempted to fit 
models of this form by regression. The cortical population trajectories 
(either the single-trial or trial-averaged estimates) for the first three 
principal components were numerically differentiated. We took these 
estimates of the derivative of the cortical state as the dependent vari- 
ables for regression analyses using three different sets of independent 
variables: (1) thalamic population trajectories (first N principal compo- 
nents, with VN ranging from 1to 10), (2) cortical population trajectories 
(first N components), and (3) both thalamic and cortical population 
trajectories (first N components). These regression models were fit 
with ordinary least squares for both the trial-averaged (Fig. 5d, top 
row) and single-trial data (Fig. 5d, bottom row), with the goodness-of- 
fit of each model given by the coefficient of determination (R’ value). 

Allanalyses were performed with custom-written software in Matlab, 
except where otherwise noted. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Summary of effects of optogenetic perturbations of 
motor cortex. Each of the three columns shows data from one mouse type: left, 
VGAT-ChR2-eYFP (n=5 mice, n=7 sessions); middle, TIx3-Cre x Ai32 (n=3 mice, 
n=7sessions); right, Sim1-Cre x Ai32 (n=3 mice, n=5 sessions). a, Average z- 
scored firing rates of motor cortical neurons before, during and after 
optogenetic activation of inhibitory interneurons (left), intratelencephalic 
neurons (middle) and pyramidal tract neurons (right). The blue bars under thex 
axes represent laser-on epochs. Left, black bands at the bottom are putative 
inhibitory interneurons. b, Firing rates before and during laser stimulation for 
each mouse type. Firing rates outside the range 0.1-100 were plotted at these 
values owing to the log-log scale. c, Distribution of lift times on control 
(yellow), laser + cue (blue) and laser-only (magenta) trials for each mouse type. 


Histograms show data only for trials where alift occurred. d, Probability of a lift 
in each time bin (binomial maximum likelihood estimate) for control (yellow), 
laser-only (magenta) and laser + cue (blue) trials. Error bars show 95% binomial 
confidence intervals. e, Distribution of lift times for trials in which a lift 
occurred within 500 ms of either the cue (for control trials, yellow) or following 
the end of the laser (blue for laser + cue trials and magenta for laser only trials). 
f, Average hand trajectories on control (yellow) and post-laser (blue) reaches. 
g, Neural population activity from lift -100 ms to lift +425 ms on control 
(yellow) and post-laser (blue) reaches, obtained using trial-averaged PCA. Forf 
and g, one session was removed for VGAT (n=4 mice, n=6 sessions), and one 
was removed for Sim1 (n=2 mice, n=4 sessions), owing to the absence of post- 
laser reaches for alignment. 
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Extended Data Fig. 2| Comparison of the direction of neural trajectories for 
post-laser reaches with the direction of control trajectories, and with the 
direction to the initial cortical state on control trials. a, Explanation of the 
analysis method. We represent the population trajectory on control trials, r.(), 
and laser trials, r,(0), using the first six principal component scores, which 
account for 98%, 99% and 97% of the variance on control trials for VGAT, TIx3 
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(0 for post-laser reaches, we obtain the direction of the neural trajectory by 
computing the derivative and dividing by the norm of the derivative (blue). We 
perform the same calculation for the control trajectory r,(¢) (yellow), and also 
compute the direction from the neural state in the laser trajectory to the initial 
control state (red). We then compare the direction of the laser trajectory with 
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the control direction and the direction to the initial control state by taking the 
inner product with each. b, Left, neural population trajectories (first two 
principal components) for control (yellow) and post-laser (blue) reachesin 
VGAT-ChR2-eYFP mice (n=4 mice, n=6 sessions). The direction of the 
trajectories for control (yellow arrows) and laser (blue arrows) trajectories 
along the first two principal components are shown, along with the direction 
from the laser trajectory to the control initial state (red arrows). Right, 
similarity (inner product) between the direction of the laser trajectory and the 
direction of the control trajectory (yellow curve), and similarity between the 
direction of the laser trajectory and the control initial state (red curve).c, Asin 
b, but for T1x3-Cre x Ai32 mice (n=3 mice, n=7 sessions). d, As inb, but for Sim1- 
Cre x Ai32 mice (n=2 mice, n=4 sessions). 
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Extended Data Fig. 3 | Decoding of hand velocity from motor cortical 
activity on control and post-perturbation reaches. a, Left, scatterplots of 
decoded versus observed hand velocity in the forward, right, and upward 
directions on control reaches in an example session from a VGAT-ChR2-eYFP 
mouse. Only testing trials not used for training the decoder were used. Right, 
R? values for the regression of observed on decoded velocities for control 
reaches in each VGAT-ChR2-eYFP dataset (n=4 mice, n= 6 sessions). b, Left, 
scatter plots of decoded versus observed hand velocity for post-laser reaches 
in the dataset froma. Right, R? values for the regression of observed on 
decoded velocities for post-laser reaches in each VGAT-ChR2-eYFP dataset. 

c, Comparison of the decoder performance in control versus post-laser reaches 
for the dataset froma, bassessed using the R? computed after pooling across 
directions. d, Decoded position trajectories obtained by upsampling and 
numerically integrating (Simpson's rule) the decoded velocity for control trials 


(left) and laser trials (right) for the dataset ina, b.e-h, Decoder performance 
for TIx3-Cre x Ai32 mice (n=3 mice, n=7 sessions), organized as ina-d.i-I, 
Decoder performance for Sim1-Cre x Ai32 mice (n=2 mice, n=3 sessions), 
organized asina-d.m, Decoding performance for control testing trials on all 
sessions, by decoding method used (n=9 mice, n=16 sessions; all perturbation 
types aggregated). PCAavg refers to PCA coefficients extracted on lift-aligned 
trial averages with single trials projected onto these coefficients; PCAcat refers 
to PCA coefficients extracted on concatenated trial data; MU refers to 
multiunit activity; and SU refers to single units. For each method, the number 
of neural dimensions used for decoding was cross-validated (see Methods). Box 
plot shows the median and the 25th and 75th percentiles. n, Decoding 
performance for the PCAavg MU method, by the standard deviation of the 
Gaussian kernel used to extract firing rates (n= 9 mice, n=16 sessions). 
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Extended Data Fig. 4 | Variability of firing rates during optogenetic 
perturbations tothe cortical state. a, Standard deviation of firing rates across 
trials during laser stimulation in VGAT-ChR2-eYFP mice. The black curve is the 
standard deviation (over trials), averaged over all neurons (n=5 mice,n=7 
sessions, n=155 neurons). Error bars shows.e.m. Identified inhibitory neurons, 
which exhibited a firing-rate increase during the laser, were excluded. 
Smoothing was applied with a 50-ms Gaussian kernel for each trial. b, Standard 


Laser, 2s Laser, 2s 


deviation of firing rates across trials during laser stimulation in TIx3-Cre x Ai32 
mice, asina(n=3 mice, n=7 sessions, n=100 neurons). c, Standard deviation of 
firing rates across trials during laser stimulation in Sim1-Cre x Ai32 mice (n=3 
mice, n=5 sessions, n=115 neurons). Because it wasn’t possible to identify 
inhibitory neurons when excitatory neurons were stimulated, all cells were 
includedinbandc. 
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Extended Data Fig. 5| See next page for caption. 


Extended Data Fig. 5| Effect of different spike-train smoothing methods. 

a, Gaussian smoothing witha kernel width of o=25 ms for the reach-no-reach 
experiment, as shown in Fig. 2b. Note that the activity appears to change from 
the constant perturbed state slightly before the end of the laser. This is because 
the kernel smooths forward into the post-laser epoch. b, Gaussian smoothing 
witho=50 ms. The divergence from the perturbed state begins earlier owing to 
ahigher level of smoothing. c, Causal smoothing with a half-Gaussian kernel, 
truncated to use samples only from the past. Neural activity diverges from the 
perturbed state only after the end of the laser. d, Acausal smoothing with a half- 
Gaussian kernel, truncated to use samples only from the future. e, Gaussian 
smoothing in the sequential inactivation experiment witha kernel width of 
o=25ms,as shown in Fig. 3f. Note that the activity appears to change from the 


constant perturbed state slightly before the end of the cortical inactivation. 
There is also a delay fromthe start of the cortical inactivation to the arrival of 
the neural state at the constant value. f, Gaussian smoothing witho=50 ms. 

g, Causal smoothing with a half-Gaussian kernel, truncated to use samples only 
from the past. Neural activity diverges from the perturbed state only after the 
end of the laser. However, there is still alag from the start of cortical 
inactivation tothe arrival of neural activity at the constant perturbed state. 

h, Acausal smoothing witha half-Gaussian kernel, truncated to use samples 
only from the future. Neural activity again diverges from the perturbed state 
before the end of the cortical inactivation. At the start of the cortical 
inactivation, neural activity has already arrived at the perturbed state. 
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Extended Data Fig. 7 | Sequential inactivation of cortex and thalamus. 
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inactivation only (blue) and sequential inactivation of cortex and thalamus 
(green) (n=3 mice, n=4 sessions). The cortical inactivation ends at 2,000 ms 
from the start of the trial, and the thalamic inactivation ends at 4,000 ms. Bars 
show maximum likelihood estimates of the binomial probability, with 95% 
confidence intervals. Corresponding data in Fig. 3d-f. b, Lift-locked neural 
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Extended Data Fig. 8 | Effects of stimulation ofthalamocorticalterminalson dataset shownin Fig. 4b. Right, hand position and neural activity inthe same 


cortical firing rates and behaviour. a, Firing rates and spike rasters for two session under optogenetic stimulation of thalamocortical terminals at 4 Hz, 
example neurons at each stimulation frequency. b, Firing rates in the 2s before 10 Hzand 40 Hz. d, Probability that a lift is initiated within the first S00 ms of 
stimulation versus the 2s during stimulation at each stimulation frequency. the cue oncontroltrials and at each stimulation frequency. Each curve shows a 
Each point is a single neuron (n= 288 cells). c, Left, single-trial hand position single session (n=6 sessions, n=3 mice). Corresponding data in Fig. 4. 
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Extended Data Fig. 10 | Simultaneous recording in thalamus and motor 
cortex. a, Raw data from the thalamic Neuropixels probe aligned to motor 
cortex stimulation (cyan). A band of channels (red dotted line) exhibited 
activity locked to motor cortical stimulation, indicating projections to motor 
cortex. b, Histological section showing targeting of probe to motor thalamus. 
The bright region in the thalamus indicates ChR2 expression (eYFP) in an Ai32 
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mouse with an injection of AAV-2/9-Syn-Cre. The Neuropixels probe inthe 
thalamus was coated witha green dye (DiO). The red dotted line corresponds 
approximately to the red dotted lineina.c, Spike rasters from cortical and 
thalamic neurons onasingle reaching trial. d, Peri-lift firing rates for thalamic 
neurons (n=3 mice, n=3 sessions). 
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Sample size No statistical methods were used to predetermine sample size. At least n = 3 mice were used for each type of experiment, and experiments 
were performed over several days for each animal, as is typical in electrophysiological studies of motor control in vertebrates. Due to the lack 
of post-laser reaches in one Sim1-Cre X Ai32 mouse, this animal was not included for the subset of analyses focusing on post-laser reaches 
(leaving n = 2 mice for these analyses, but n = 3 mice for the other analyses). 


Data exclusions Cortical recordings were aborted when the quality of the raw signals was very poor (n = 2 VGAT-ChR2-EYFP mice and n = 1 Sim1-Cre X Ai32 
mouse). For the simultaneous recordings in thalamus and cortex, sessions were excluded when the raw signals were poor in one or both 
regions or when either histology or optogenetic tagging attempts indicated that the first probe was not in motor thalamus (n = 15 sessions). 


Replication The main results from each experiment were identified in a subset of sessions and replicated in subsequent sessions. In cases where some 
differences were observed between mice or sessions (e.g. the limb oscillation under 10 Hz stimulation in Fig. E9), this is stated in the text and 
shown in the figures. 


Randomization Not applicable: all experimental manipulations were performed at the level of trials within each session, and there was no allocation of 
animals into different treatment groups. 


Blinding Because the experimental manipulations occurred at the level of trials (laser stimulation vs no stimulation), it was not possible to blind the 
experimenter to the trial type. There was no allocation of animals into different treatment groups. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
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Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
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Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Mice of either sex were used, at age 8-25 weeks. The following mouse lines were used: 
(1) VGAT-ChR2-EYFP (Slc32a1-COP4*H134R/EYFP) 
(2) Vglut2-IRES-Cre 
(3) Ai32 (Rosa-CAG-LSL-ChR2(H134R)-EYFP-WPRE) 
(4) Ai32 (Rosa-CAG-LSL-ChR2(H134R)-EYFP-WPRE) crossed to one of the following driver lines: 
(4a) Tlx3-Cre PL56Gsat 
(4b) Sim1-Cre KJ18Gsat 
(4c) Rbp4-Cre KL100Gsat 


Wild animals N/A 
Field-collected samples N/A 
Ethics oversight All procedures were approved by the Institutional Animal Care and Use Committee at Janelia Research Campus (protocols 13-99, 


16-139, and 19-177). 


=) 
fev) 
a 
iS 
= 
a) 
= 
a) 
Wn 
a) 
je) 
= 
a) 
=F 
= 
@) 
I) 
©) 
a 
= 
© 
Wn 
iS 
3 
fev) 
= 
=< 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


dé 


O 


Bul 


> 


yi 
= 
= 
= 
D 
SZ 


Article 


The emergence of transcriptional identity in 
somatosensory neurons 


https://doi.org/10.1038/s41586-019-1900-1 


Received: 17 April 2019 


Accepted: 6 November 2019 


Published online: 8 January 2020 


Nikhil Sharma’”, Kali Flaherty'”, Karina Lezgiyeva'?, Daniel E. Wagner’, Allon M. Klein? & 
David D. Ginty'”* 


More than twelve morphologically and physiologically distinct subtypes of primary 
somatosensory neuron report salient features of our internal and external 
environments'~. It is unclear how specialized gene expression programs emerge 
during development to endow these subtypes with their unique properties. To assess 
the developmental progression of transcriptional maturation of each subtype of 
principal somatosensory neuron, we generated a transcriptomic atlas of cells 
traversing the primary somatosensory neuron lineage in mice. Here we show that 
somatosensory neurogenesis gives rise to neurons ina transcriptionally 
unspecialized state, characterized by co-expression of transcription factors that 
become restricted to select subtypes as development proceeds. Single-cell 
transcriptomic analyses of sensory neurons from mutant mice lacking transcription 
factors suggest that these broad-to-restricted transcription factors coordinate 
subtype-specific gene expression programs in subtypes in which their expression is 
maintained. We also show that neuronal targets are involved in this process; 
disruption of the prototypic target-derived neurotrophic factor NGF leads to aberrant 
subtype-restricted patterns of transcription factor expression. Our findings supporta 
model in which cues that emanate from intermediate and final target fields promote 
neuronal diversification in part by transitioning cells froma transcriptionally 
unspecialized state to transcriptionally distinct subtypes by modulating the selection 
of subtype-restricted transcription factors. 


Decades of analyses have identified more than twelve functionally 
distinct subtypes of dorsal root ganglia (DRG) somatosensory neu- 
ron that collectively enable the detection of a broad range of salient 
features of the internal and external world’ *. A fundamental question 
in sensory and developmental biology is how somatosensory neuron 
subtypes acquire their characteristic physiological, morphological, 
and synaptic properties during development, enabling animals to 
detect and respond to innocuous and noxious thermal, chemical, 
and mechanical stimuli. Classical studies of embryonic development 
have indicated that migrating multipotent neural crest progenitors, 
originating from the dorsal neural tube, populate nascent DRG®. During 
ganglia formation, dedicated progenitors that express either neuro- 
genin-1 (NEUROGI) or neurogenin-2 (NEUROG2) have been proposed 
to give rise to distinct somatosensory neuron subtypes®, which then 
innervate peripheral target fields where they form morphologically 
distinct types of axonal endings’. Current models of somatosensory 
neuron development have primarily been inferred from studies that 
have analysed changes in expression of individual genes or types of 
axonal ending in loss-of-function models’”®. Here, we use genome-wide 
transcriptomic analyses coupled with molecular genetic approaches 
to define the transcriptional mechanisms by which somatosensory 
neuron subtypes diversify. 


scRNA-seq of somatosensory neurons 


To begin to define the transcriptional cascades that underlie the speci- 
fication of somatosensory neuron subtypes, we performed single-cell 
RNA sequencing (scRNA-seq) at embryonic day 11.5 (E11.5), which is 
shortly after DRG formation, and at critical developmental milestones 
during somatosensory neuron development: at E12.5, when virtually 
all DRG neurons are postmitotic’ and have extended axons well into 
the periphery; at E15.5, when peripheral and central target fields of 
somatosensory neurons are being innervated’°”; at postnatal day O 
(PO), when maturation of sensory neuron endings within the skin and 
other targets is occurring’; at P5, when most peripheral endings have 
refined into their mature morphological states and central projection 
terminals are properly organized within select spinal cord laminae**; 
and in early adulthood (P28-42) (Fig. 1a, Extended Data Fig. la-f). We 
first examined primary sensory neurons in young adult DRG from all 
axial levels (Fig. 1a, Extended Data Fig. 1a). We used principal compo- 
nent analysis (PCA) and t-distributed stochastic neighbour embedding 
(t-SNE) to cluster adult DRG neurons on the basis of transcriptome 
similarity (Fig. la). Each cluster was classified as a subtype on the basis 
of previous studies that have described markers and functions for 
individual somatosensory neuron subtypes, confirmation by in situ 
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Fig. 1|scRNA-seq of developing and mature DRG sensory neurons. a, t-SNE 
visualizations of DRG scRNA-seq data. SST, somatostatin. b, UMAP 
visualization of DRG scRNA-seq data from E11.5 with developmental trajectory 
and gene expression information overlaid. TPT, tags per ten thousand. 

c, Quantification (mean +s.e.m.) of tdTomato* neurons and representative 
image (right). d, Heat map and quantification of genes enriched in each 


analysis, and comparison to scRNA-seq data from adult trigeminal 
ganglia (see Methods, Extended Data Figs. 2a, b, 3a-d, Supplementary 
Data 1). These cell-type classifications are consistent with previously 
published RNA-seq findings of adult DRG and trigeminal ganglia’. 
We next sought to determine how the transcriptional identities of 
mature somatosensory neuron subtypes compare to those of newborn 
sensory neurons by analysing the transcriptomes of cells from DRG at 
F11.5 (Fig. Ib). We visualized the E11.5 scRNA-seq data using uniform 
manifold approximation and projection (UMAP)”°. Mapping historically 
defined marker genes onto the UMAP representation revealed three 
principal cell types in E11.5 ganglia: 1) multipotent neural crest progeni- 
tors (NCPs), marked by Sox10”!”*; 2) sensory neuron progenitors (SNPs), 
marked by Neurog1 and Neurog2”; and 3) nascent, postmitotic sensory 
neurons marked by expression of the somatosensory neuron gene advil- 
lin (Avil™*) and concomitant loss of expression of cell-cycle-associated 
genes (Fig. 1b, Extended Data Fig. 4a). Monocle 3” was then used to infer 


somatosensory neuron subtype as well as their expression levels in 
unspecialized sensory neurons (USNs). Boxes represent interquartile range 
(IQR), whiskers represent minimum and maximum values, and notches 
represent the 95% confidence interval of the median. *P< 0.0001, two-sided 
Wilcoxon rank-sum test with Bonferroni correction. For n values, see Methods. 


developmental relationships between NCPs, SNPs, and nascent Avil’ 
sensory neurons. This analysis revealed a single continuous trajectory 
connecting NCPs, SNPs, and Avil* sensory neurons, suggesting a lack 
of transcriptional diversity in sensory neuron progenitors populating 
the Avil’ compartment. Unexpectedly, labelled descendants of Neu- 
rogl* SNPs from Neurog1“; Rosa26"4" mice had a broad range 
of cell diameters (Fig. 1c), consistent with the previous finding that 
in vitro directed differentiation with Neurogi overexpression leads to 
the generation of bothsmall- and large-diameter DRG sensory neuron 
subtypes”®. Moreover, genes that were highly enriched in E11.5 Avil* 
sensory neurons, relative to progenitors, generally remained expressed 
in adult somatosensory neuron subtypes (Extended Data Fig. 4b). On 
the other hand, the majority of genes whose expression was restricted 
to individual terminal somatosensory subtypes of adult ganglia were 
expressed at trace levels in E11.5 Avil’ sensory neurons (Fig. 1d), sug- 
gesting that upon cell-cycle exit sensory neurons are transcriptionally 
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Fig. 2| Transcriptional development of subtypes of DRG neurons. a, Force- 
directed layout of DRG sensory neurons overlaid with time point or cell-type 
information. b, Force-directed layout of DRG development overlaid with 


unspecialized, specifically with respect to subtype-specific genes. 
These observations led us to consider whether a ‘transcriptionally 
unspecialized state’ serves as the starting point for somatosensory 
subtype diversification. 

To investigate this idea, we compared scRNA-seq transcriptomes 
generated from sensory neurons between E11.5 and adulthood. Pro- 
spective identities of sensory neurons at each developmental stage 
were assigned on the basis of transcriptional similarity using canoni- 
cal correlation analysis” (Fig. 1a, Extended Data Fig. 1f) and a graph- 
based strategy for locally embedding consecutive time points based 
on the transcriptional variation they share. We constructed single- 
cell k-nearest neighbour graphs for each time point (¢,) with nodes 
representing cells and edges linking neighbours. These graphs were 
then joined by identifying neighbouring cells in adjacent time points 
using a coordinate system learned from the subsequent time point 
(t;,.,3 see Methods). The resulting graph forms a branching network 
that can be visualized using a force-directed layout. This representa- 
tionspans all developmental stages and provides a consolidated view 
of the transcriptional maturation of each principal somatosensory 
neuron subtype from E11.5 to adulthood (Fig. 2a). 

We next tested whether this graph-based representation of devel- 
opmental gene expression profiles of sensory neuron subtypes reca- 
pitulates known developmental relationships. We inspected the gene 
expression patterns of the transcription factors (TFs) Runx1 and Runx3, 
which have been implicated inthe development of select unmyelinated 
(C-fibre) neuron subtypes and proprioceptors, respectively”? *°. Runx1 
expression was prominent in unmyelinated sensory neuron subtypes, 
whereas Runx3 expression was restricted to mature proprioceptors of 
adult ganglia, as previously described”*” (Fig. 2b). Furthermore, the 
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a 
Individual cells 

expression of indicated genes. c, Heat map of subtype-restricted TFs ineach 

somatosensory neuron subtype of adult ganglia. For n values, see Methods. 


graph-based representation accurately depicts the developmental 
switch from NerkI' to Ret’ that occurs in subsets of non-peptidergic 
C-fibre neurons” (Fig. 2b). To facilitate exploration of this dataset 
by the community, we created an HTML-based interactive interface 
to enable visualization of the expression pattern of any gene at each 
developmental time point, from E11.5 to adulthood, for each of the 
somatosensory neuron subtypes (interactive somatosensory neuron 
browser: https://kleintools.hms.harvard.edu/tools/springViewer_1_6_ 
dev.html?datasets/Sharma2019/all). 


TFsin sensory neuron development 


One observation from our initial analysis of the graph-based repre- 
sentation of developmental transcriptomes of sensory neurons is that 
the TFs Runx1 and Runx3, which have been implicated in the develop- 
ment of sensory neuron subtypes, are broadly co-expressed in nascent 
E11.5 Avil* sensory neurons, in contrast to their mutually exclusive 
expression patterns in terminally differentiated subtypes of adult DRG 
(Fig. 2b). This is consistent with the finding that RUNX1 and RUNX3 
proteins are colocalized in embryonic DRG”. This observation led us 
to consider whether other TFs that are subtype-restricted in adult gan- 
glia may be co-expressed in nascent, transcriptionally unspecialized 
sensory neurons. To address this possibility, we identified TFs other 
than Runx1 and Runx3 that are expressed in select subtypes of soma- 
tosensory neurons from mature ganglia by inspecting 1,152 neuronally 
expressed TFs, and found that 23 are expressed in distinct subsets 
of adult somatosensory neurons (Fig. 2c). Strikingly, the sCRNA-seq 
data revealed that several TFs expressed in select subtypes of sensory 
neurons of mature DRG are co-expressed in newborn E11.5 sensory 
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Fig. 3 | The unspecialized sensory neuron compartment gives rise to most or 
all somatosensory neuron subtypes. a, Developmental trajectory of sensory 
neurons (left) and t-SNE visualization with TF expression overlaid (right). 

b, Schematic description of the Avil““"”?Rosa26“4™” labelling strategy. 


neurons (Extended Data Fig. 5a), like Runx1 and Runx3. These scRNA-seq 
findings were verified using double single-molecule RNA fluorescent 
in situ hybridization (SmRNA-FISH), with Runx1 and Runx3 as well as 
Pou4f2 and Pou4f3 serving as test cases. smRNA-FISH measurements 
showed that Runx1 and Runx3 as well as Pou4f2 (AB rapidly adapting 
(RA)-low-threshold mechanoreceptors (LTMRs), AS-LTMRs, C-LTMRs) 
and Pou4f3 (calcitonin gene-related peptide (CGRP)-as and CGRP-ns) 
are co-expressed in the majority of E11.5 Avil’ sensory neurons, despite 
their mutually exclusive expression patterns in neurons from PO and 
adult ganglia (Extended Data Fig. 5b, c). These observations suggest 
that all somatosensory neuron subtypes transit through a postmi- 
totic transcriptionally unspecialized state. To further address this, we 
genetically labelled descendants in the Avil’ cellcompartment at E11.5 
by administering a low dose of tamoxifen (0.5 mg) at E11.5 to Avil(®”?; 
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c, t-SNE visualization of Avil“*”?Rosa26'"!47™0 scRNA-seq with cell-type 
identity or tdTomato expression overlaid. d, Schematic representing strategy 
for labelling neurons in Pou4f2°"” mice. e, smRNA-FISH and quantification for 
the indicated transcripts (mean+s.e.m.). Forn values, see Methods. 


Rosa26'5"*47omato mice and found tdTomato transcripts to be present in 
5-19% of cells in each somatosensory neuron subtype of adult ganglia 
by scRNA-seq (Fig. 3a-c). In addition, we labelled all descendants of 
one of the ‘broad early’-to-‘subtype restricted late’ TFs, Pou4f2, with 
tdTomato using a Pou4f2“; Rosa26“4™0 mouse line, and trans- 
duced terminally differentiated Pou4f2* DRG subtypes in the same 
mouse using an adeno-associated virus (AAV) carrying a Cre-dependent 
GFP reporter delivered at P14 (Fig. 3d). smRNA-FISH analysis revealed 
tdTomato transcripts in more than 90% of DRG sensory neurons in 
adult Pou4f2“; Rosa26'!t47mato. 4 AV-CAG-FLEX-GFP"*"” mice; by con- 
trast, GFP transcripts were restricted to AB RA-LTMRs, A5-LTMRs and 
C-LTMRs (mature Pou4f2° populations) (Fig. 3e). Notably, a develop- 
mental analysis of subtype-specific gene expression revealed that, in 
general, large-diameter neurons achieved transcriptional maturity 
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Fig. 4 | Pou4f2 and Pou4f3 regulate select somatosensory neuron subtype 
maturation. a, t-SNE visualizations of sCRNA-seq data for neurons generated 
from Pou4f3"™ and Pou4f3* littermates. b, t-SNE visualizations of scRNA- 
seq data for neurons generated from Pou4f2" and Pou4f2 littermates. 
c, Fold-change distribution of cell-type-specific genes when comparing 
Pou4f3* and Pou4f3"™™" littermate control samples. d, Fold-change 


before small-diameter neurons, consistent with the historical view”? 


(Extended Data Fig. 6a, b). Together, these results indicate that cells in 
the transcriptionally unspecialized compartment express a broad array 
of TFs that become restricted to select subsets of sensory neurons as 
development proceeds. 


Specification of subtype identity 


We next investigated whether broad-to-restricted TFs contribute to 
sensory neuron diversification during the transcriptionally unspe- 
cialized state, thus broadly influencing the transcriptional matura- 
tion of sensory neurons, or whether these TFs primarily influence the 
subtypes in which their expression is maintained. We collected DRG 
from neonatal (PO-5) pups containing null alleles of either Pou4f2 or 
Pou4f3, which are representative broad-to-restricted TFs, and gen- 
erated scRNA-seq transcriptomes from Pou4f2C?#*0C) mice and 
Pou4f2‘” littermate controls as well Pou4f3" mice and Pou4f3™* lit- 
termate controls. Initial inspection of the scRNA-seq data from both 
Pou4f2 and Pou4f3 mutant mice revealed clusters corresponding to 
each somatosensory subtype (Fig. 4a, b). We found that cell numbers 
were not compromised, as representative ganglia (T7 and T8) from 
Pou4f2 or Pou4f3 knockouts had similar numbers of neurons to ganglia 
from littermate controls (Extended Data Fig. 7a, b). Notably, genes that 
were preferentially expressed in either Pou4f2‘ or Pou4f3* neurons 
showed reduced expression in the respective knockouts, compared 
to littermate controls, whereas the expression of randomly selected 
genes was unchanged (Fig. 4c, d). By contrast, somatosensory neu- 
ron subtypes in which expression of Pou4f2 and Pou4f3 was normally 
extinguished after E11.5 generally exhibited less marked alterations to 
subtype-specific gene expression or subtype-restricted TF expression 
in knockout mice when compared to controls (Fig. 4c, d, Extended Data 
Fig. 7a, b). Given the reduction in subtype-specific gene expression in 
Pou4f2 and Pou4f3 mutants, we also determined the consequences of 
Pou4f2 or Pou4f3 ablation on the unique axonal endings associated with 
mature somatosensory neuron subtypes. Although the axonal endings 
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distribution of cell-type-specific genes when comparing Pou4f2””° and 
Pou4f2""™™" littermates. c, d, Boxes represent IQR, whiskers represent 
minimum and maximum values, and notches represent the 95% confidence 
interval of the median. *P< 0.01, two-sided Wilcoxon rank-sum test with 
Bonferroni correction. For n values, see Methods. 


associated with Pou4f2' subtypes are known to form longitudinal lan- 
ceolate endings around hair follicles’, the axonal ending morphologies 
associated with the Pou4f3* subtypes were not known. Genetic label- 
ling experiments using newly generated Cre lines for each Pou4f3* 
subtype revealed that the axons of CGRP-a neurons have free nerve 
endings that penetrate the epidermis, whereas those of CGRP-n neurons 
form circumferential endings associated with hair follicles (Extended 
Data Fig. 8a—d). Longitudinal lanceolate endings and CGRP’ circum- 
ferential axonal endings were partially compromised in Pou4f2 and 
Pou4f3 knockout mice, respectively (Extended Data Fig. 8e-k). Further- 
more, postnatal depletion of Pou4f3 using short hairpin RNA (shRNA) 
altered subtype-specific gene expression and function. Together, these 
results show that two representative subtype-restricted TFs, Pou4f2 
and Pou4f3, control transcriptional maturation of the sensory neuron 
subtypes in which they remain expressed. 


Extrinsic control of subtype identity 


We next investigated whether differential maintenance or extinction of 
TFs in emerging subtypes of neurons occurs viaa process that is entirely 
intrinsic to developing sensory neurons or is guided by extrinsic cues. 
The mesenchymal and epidermal environments through which embry- 
onic somatosensory axons extend are rich sources of extrinsic signals, 
including neuronal growth factors®. Therefore, we tested whether nerve 
growth factor (NGF), an extrinsic cue that is crucial for the growth 
and survival of embryonic somatosensory neurons expressing the 
NGF receptor NTRK1 (TRKA)*4—which represent about 80% of adult 
DRG—may control the TF selection process. We performed scRNA-seq 
on DRG neurons from neonatal mice containing a targeted mutation 
in the Ngf gene. This genome-wide analysis of Ngf-dependent gene 
expression was carried out on the apoptosis-deficient Bax-knockout 
genetic background to circumvent the apoptotic cell death of DRG 
neurons associated with developmental loss of NGF*®. Clustering 
analysis of the scRNA-seq data revealed that all somatosensory neu- 
ron subtypes were present in Bax” controls (Fig. 5a), but there were 
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Fig. 5| The extrinsic cue NGF is required for subtype-specific gene 
expression patterns. a, t-SNE visualizations of scRNA-seq data for neurons 
generated from PO Bax” and Ngf’ Bax” littermates. b, Cell-type-specific gene 
expression in proprioceptor and A-fibre mechanoreceptor sensory neuron 
subtypes in Bax’ and Ngf’ Bax” littermates. c, Cell-type-specific gene 


fewer transcriptionally distinct neuronal populations in Ngf’ Bax” 
double mutants (Fig. 5a). Ntrk1-negative neurons (proprioceptors 
and A-fibre mechanoreceptors) were not markedly transcriptionally 
compromised in Ngf’ Bax’ mutants compared to Bax“ controls 
(Fig. 5b), as expected. However, the subtype-specific gene expression 
patterns that are normally present in NtrkI* sensory neuron subtypes 
were markedly altered in neurons from Ngf’ Bax mutants (Fig. 5c). 
Notably, examination of the aforementioned subtype-restricted TFs 
showed that the combinations of TFs expressed in the unidentified 
neuronal clusters in Ngf’ Bax mutants bore no resemblance to the 
TF combinations observed in neuronal subtypes in control mice, and 
this was confirmed using smRNA-FISH (Extended Data Fig. 9a, b). Fur- 
thermore, members of the NEUROD family of TFs, which are normally 
extinguished during embryonic development, remained expressed at 
PO in Ngf’ Bax’ mutants (Extended Data Fig. 9b, c). These findings 
indicate that the selection of somatosensory neuron subtype-restricted 
TFs is controlled, at least in part, by extrinsic cues acting on nascent 
sensory neurons. 


Discussion 


Our genome-wide transcriptomic analyses of cells traversing the devel- 
opmental stages of somatosensory neurons support a model in which 
newborn somatosensory neurons are unspecialized with respect to 
expression of subtype-restricted TFs, and differential maintenance 
of unique combinations of these subtype-restricted TFs enables nas- 
cent sensory neurons to resolve into mature subtypes (Extended Data 
Fig. 9d). Early co-expression, and subsequent resolution, of TFs has 
been proposed to underlie the diversification of stem cells in the hae- 
matopoietic lineage*’ **, neural crest progenitors before lineage com- 
mittment®’, and developing spinal motor neurons*® *—although this 


\V| -S=- 


LVS/PIOW 


expression inall other sensory neuron subtypesin Bax’ and Ngf’ Bax 
littermates. Boxes represent IQR, whiskers represent minimum and maximum 
values, and notches represent the 95% confidence interval of the median. 

b,c, Allclusters are different from controls by two-sided Wilcoxon rank-sum 
test with Bonferroni correction; P< 0.01. For n values, see Methods. 


view has been challenged in the case of the haematopoietic system”. 
The diversification of somatosensory neurons shares commonalities 
and differences with these systems. We propose that, unlike cells of the 
early neural crest and the haematopoietic lineages, somatosensory 
neuron subtypes emerge following cell-cycle exit and, unlike other 
progenitor types, newborn, postmitotic Avil* somatosensory neu- 
rons are not migratory but rather are permanent residents of sensory 
ganglia. Therefore, nascent sensory neurons cannot rely on cell divi- 
sion or migration to encounter new environments. Rather, a feature 
of nascent somatosensory neurons is that they immediately extend 
axons along intermediate targets, such as large blood vessels, en route 
to target organs, suchas the skin, where they encounter extrinsic cues, 
including NGF and other secreted factors. We propose a model in which 
multiple distinct extrinsic cues act on axons of transcriptionally unspe- 
cialized sensory neurons, depending onthe timing and trajectories of 
their projection patterns. These cues function, in part, to resolve TF 
expression patterns froma co-expressed state to asubtype-restricted 
state to promote the transcriptional specializations that underlie the 
unique molecular, morphological, and physiological properties of 
somatosensory neuron subtypes. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Animals 

All mouse experiments in this study were approved by the National 
Institutes of Health and the Harvard Medical School IACUC. Experi- 
ments followed the ethical guidelines outlined in the NIH Guide for the 
care and use of laboratory animals (https://grants.nih.gov/grants/olaw/ 
guide-for-the-care-and-use-of-laboratory-animals.pdf). Avpria™* 
and Bmpr1b"™*“ mice were generated using standard homologous 
recombination techniques in embryonic stem (ES) cells. Chimaeras 
were generated by blastocyst injection and subsequent germline trans- 
mission was confirmed by tail PCR. The neo selection cassette was 
excised using a Flp-deleter strain for the Avprla’"“ line but left intact 
for the Bmpr1b*“ line. Mice were housed under standard conditions 
and given chow and water ad libitum. Plug date was considered E0.5 
and date of birth was considered PO. Pou4f3 null mice were obtained 
fromJax (Stock No. 008645). Pou4f2 null(Cre) mice were obtained from 
Jax (Stock No. 030357). ROSA26 Cre-dependent tdTomato reporter 
mice were obtained from Jax (Stock No. 007914). Avil“®” mice were 
obtained from Jax (Stock No. 032027). All experiments with wild- 
type animals were conducted with mice on the C57BI/6) background 
obtained from Jackson Laboratory. 


Dissociation and purification of isolated single sensory neurons 
The dissection strategy used was nearly identical for allages presented 
inthis study. Specifically, animals were killed, and spinal columns were 
removed and placed on atray of ice. Individual DRG with central and 
peripheral nerves attached were removed from all axial levels and 
placed into ice-cold DMEM:F12 (1:1) supplemented with 1% pen/strep 
and 12.5mM D-glucose. A fine dissection was performed to remove the 
peripheral and central nerve roots, resulting in only the sensory ganglia 
remaining. We collected 200-400 individual ganglia for the DRG and 
20-30 ganglia for the trigeminal for each bioreplicate of single-cell 
sequencing. All scRNA-seq experiments in this study were performed 
with >2 bioreplicates. Sensory ganglia were dissociated in 40 units 
papain, 4 mg/ml collagenase, 10 mg/ml BSA, 1 mg/ml hyalurdonidase, 
0.6 mg/ml DNase in DMEM:F12 + 1% pen/strep + 12.5 mM glucose for 
10 min at 37 °C. Digestion was quenched using 20 mg/ml ovomucoid 
(trypsin inhibitor),20 mg/ml BSA in DMEM:F12 + 1% pen/strep + 12.5 mM 
glucose. Ganglia were gently triturated with fire-polished glass pipettes 
(opening diameter of approx. 150-200 pm). Neurons were then passed 
through a 70-um filter to remove cell doublets and debris. Neurons were 
pelleted and washed 4-8x in20 mg/ml ovomucoid (trypsin inhibitor), 
20 mg/ml BSA in DMEM:F12 + 1% pen/strep + 12.5 mM glucose followed 
by 2x washes with DMEM:F12 + 1% pen/strep + 12.5 mM glucose all at 
4 °C. After washing, cells were resuspended in 50-200 pl of DMEM:F12+ 
1% pen/strep + 12.5 mM glucose. Cells were counterstained with trypan 
blue, visually inspected and counted with a haemocytometer. Disso- 
ciated ganglia preparations were considered to pass quality control 
and used for scRNA-seq if >90% of cells were viable, as measured by 
exclusion of trypan blue, and virtually no cellular debris was visible. 


Tissue processing for RNA-FISH 

For sample preparation, individual DRG from mice were rapidly 
dissected and the axial level was identified by identifying specific 
DRG using the T12 DRG as a landmark. The T12 DRG was defined as the 
ganglion immediately caudal to the last rib. DRG were frozen in dry- 
ice-cooled 2-metylbutane and stored at —80 °C until they were sec- 
tioned. DRG were sectioned at a thickness of 15-20 um and RNAs 
were detected by RNAscope (Advanced Cell Diagnostics) using the 
manufacturer’s protocol. Total numbers of neurons per section of 


DRG were estimated by counting neuronal nuclei as measured by DAPI 
and counts were confirmed as reasonable estimates by comparison to 
counts obtained by measuring Avil or Pou4f1, which are both pan-soma- 
tosensory neuron markers. The numbers of somatosensory neurons 
per section were similar for DAPI compared with Avil or Pou4f1 meas- 
urements. The following probes were used: Mm-Th™ (Cat#: 317621), 
Mm-Calb1 (Cat#-428431), Mm-Pou4f2 (custom made), Mm-Pou4f3 
(custom made), Mm-Auvil (Cat#: 498531), Mm-Asicl (Cat#: 480581), 
Mm-Mrgpra3 (Cat#: 548161), Mm-Pou4fl (Cat#:414671),Mm-Colg (Cat#: 
496211), Mm-Sst (Cat#: 404631), Mm-Pualb (Cat#: 421931), Mm-/kzf1 
(Cat#: 511201), Mm-Aupria (Cat#: 418061), Mm-Oprk1 (Cat#: 316111), 
Mm-Mrgprd (Cat#: 417921), Mm-Bmpr1b (custom made), Mm-Vcan 
(Cat#: 486231), Mm-7rpm8 (Cat#: 420451), Mm-Neurod1 (Cat#: 416871), 
Mm-Neurod6 (Cat#: 444851), Mm-Shox2 (Cat#: 554291), Mm-Hopx 
(Cat#: 405161), Mm-Runx1 (Cat#: 406671), Mm-Runx3 (Cat#: 451271), 
GFP (Cat#: 400281), tdTomato (Cat#: 317041). 


Single-cell RNA library preparation, sequencing, and analysis 
Single-cell RNA-seq was performed with the 10x Genomics Chromium 
Single Cell Kit (v2 and v3). Approximately 1,000-8,000 cells were added 
tothe reverse transcription mix before loading onthe microfluidic chip. 
Downstream reverse transcription, cDNA synthesis/amplification, and 
library preparation were performed according to the manufacturer's 
instructions. Allsamples were sequenced ona NextSeq 500 with58 bp 
sequenced into the 3’ end of the mRNAs. Initial gene expression tables 
for individual barcodes were generated using the cellranger pipeline 
according to instructions provided by 10x Genomics. All gene expres- 
sion tables were then imported into R and analysed with Seurat (v.2.3) 
with standard procedures. 


Cluster identification. Clusters were classified into transcription- 
ally distinct somatosensory neuron subtypes: AB RA-LTMRs***, AB 
field-LTMRs/AB SA1-LTMRs***”, AS-LTMRs*°*8, C-LTMRs*°*?, CGRP* 
neurons (containing six transcriptionally discrete subtypes), 
MRGPRD* polymodal nociceptors**” ™, proprioceptors>”*, SST* pruri- 
ceptors (somatostatin/NPPB’)*”*, cold-sensitive thermoceptors”*, 
and two main classes of support cell (endothelial and Schwann cells). 
Atranscriptionally distinct cluster uniquely corresponding to Mer- 
kel cell-associated AB SA1-LTMRs was not detected. However, based 
on bulk RNA-seq analysis of genetically defined and fluorescence- 
activated cell sorting (FACS)-purified LTMR subtypes, AB SA1-LTMRs 
contain transcriptomes with a striking resemblance to those of AB 
field-LTMRs“; therefore, these two AB LTMR subtypes are likely to be 
embedded within the same cluster in our t-SNE plot. We confirmed that 
marker genes for each of the sensory neuron subtypes were expressed 
in subsets of DRG neurons and noted that the relative proportions 
of certain sensory neuron subtypes varied across ganglia located at 
different axial levels (Extended Data Fig. 2a, b). Moreover, the soma- 
tosensory neuron subtypes identified in this adult DRG analysis were 
remarkably similar to those identified in sCRNA-seq analysis of 5,556 
somatosensory neurons obtained from adult trigeminal ganglia (TG) 
(Extended Data Fig. 3a—d). The cell types identified by our sCRNA-seq 
findings are largely consistent with previously published adult DRG/ 
TGscRNA-seq datasets'*791, 


Exclusion criteria. As a first quality control filter, individual cells were 
removed from the dataset if they had fewer than 1,000 discovered 
genes, fewer than 1,000 unique molecule identifiers (UMIs) or more 
than 5% of reads mapping to mitochondrial genes (several datasets use 
a10% threshold for this parameter and this is indicated in the respective 
Figures). Preparing single-cell suspensions of DRG/TG sensory neurons 
often results in a population of non-neuronal-neuronal doublets. To 
circumvent this, we defined individual cells showing expression of 
Schwann cell markers (Sox2 or Ednrb) and neuronal markers as neurons 
that did not resolve into single cells during the dissociation process. 
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Cells matching these criteria were removed before performing subse- 
quent analysis and this analysis was applied to all datasets presented in 
this study. Lastly, for simplicity, most displays exclude non-neuronal 
cells (Schwann and endothelial). Generally, we found that <10% of cells 
in any given dataset were classified as non-neuronal. 


General analysis parameters. Raw UMI counts were normalized to 
10,000 UMIs per cell. Highly variable genes were calculated using the 
FindVariableGenes function with mean.function = ExpMean, disper- 
sion.function = LogVMR, x.low.cutoff = 0, and y.cutoff = 0.5. PCA and 
t-SNE analyses were used for dimensionality reduction and elbow plots 
were generated to determine which principal components to include 
in the analysis. This corresponded to roughly the first 20 principal 
components. Canonical correlation analysis (CCA) and matching of cell 
types through development were performed as previously described”. 


Identification of differentially expressed genes. Differential gene 
expression analysis was performed on all expressed genes using the 
FindMarker function in Seurat using the Wilcoxon rank-sum test and 
a pseudocount of 0.001 was added to each gene to prevent infinite 
values. Pvalues <10 *” were defined as O, as the R environment does not 
handle numbers <10°”. Each identified cell type was compared against 
an outgroup which corresponded to all other cells in the dataset at the 
respective time point. All genes identified were spot checked by overlay- 
ing the expression levels on thet-SNE plot to ensure the computational 
method was faithfully identifying genes with the prescribed features. 
For subtype-specific gene expression analysis, subtype-specific genes 
were first defined using the littermate control mice because knockout 
mice were not always available on a pure C57/BI6 background. The 
subtype-specific genes identified in littermate control mice were nearly 
identical to those observed in C57/Bl6 control animals. Of the top 100 
subtype-specific genes, 50 were randomly selected from this group and 
compared to the knockout controls. Fifty expression-matched genes 
that were not included in the subtype-specific gene list were selected 
as randomized control genes. 


Monocle 3 analysis (for E11 trajectory analysis). The Monocle 3 work- 
flow was performed ina similar fashion as previously described”. In 
brief, the Monocle 3 pipeline offers several key advantages. First, this 
pipeline allows the generation of trajectories over potentially discon- 
tinuous underlying data. This is first accomplished by performing di- 
mensionality reduction with the recently proposed UMAP algorithm”’, 
instead of t-SNE. Notably, UMAP provides comparable visualization 
quality to t-SNE and UMAP also performs better at preserving global 
relationships, which is a noted shortcoming of the t-SNE algorithm. 
Furthermore, the UMAP algorithm is more efficient [O(N)] compared 
to t-SNE [Mog(N)], making UMAP a more computationally friendly 
option for large datasets, as used in this study. The UMAP parameters 
used in this study are comparable to those previously applied” (reduc- 
tion.use = ‘PCA’, max.dim = 2L, neighbours = 50, min_dist = 0.1, cosine 
distance metric). Similar parameters have been used to finely resolve 
subtrajectories® and therefore we argue that these parameters provide 
the greatest sensitivity for identifying branches, if they exist, within 
our dataset. 


STITCH analysis. Although UMAP provides an advance in gene expres- 
sion-based trajectory inference, more complex changes in gene expres- 
sion space, as are often observed during development®, continue to 
provide a marked challenge to identifying underlying trajectories. A 
recently proposed algorithm, STITCH”, provides an alternative strat- 
egy, whichis described here in brief. Instead of projecting all the data 
into a single low-dimensional space, STITCH assembles a manifold that 
is defined by a series of independent PCA subspaces corresponding 
to each individual time point with nodes representing cells and edges 
linking transcriptionally similar cells ina low-dimensional space. This 


allows connections between cells to be identified even if cells are opti- 
mally described by differing underlying PC subspaces. From here, each 
cellintime point ¢, where ie (E11.5, E12.5, E15.5, PO, P5, adult) forms an 
outgoing edge from ¢,> t,and ¢,> ¢,_,, Vie (time points) where all cells 
are projected into the PC subspace defined by ¢,alone. In essence, edges 
connect each cell to its closest transcriptional neighbour within atime 
point and the preceding time point. Edges are then subjected to local 
neighbourhood restriction such that an outgoing edge from acell was 
maintained if its neighbours were at most threefold as far as the cell’s 
closest neighbour. To avoid spurious connections that may form, edges 
were next subjected to a global neighbourhood restriction where edges 
are maintained if they were below the average edge distance across all 
cells between time points (¢,, ¢;_,) or within 1 standard deviation of the 
average edge distance within the time point. The graph was further 
reduced by retaining at most 20 mutual nearest neighbour edges. 


Cloning, production, purification, concentration and quality 
control of AAV 

AAV backbones were generated using standing cloning and molecular 
biology techniques. The following sequences were used for shRNAs: 
luciferase (GCGCGATAGCGCTAATAATTT) and Pou4f3 (TATCCCTTG- 
GAGAAAAGCCTTGTT). AAVs included GFP, tagged with haemagglu- 
tinin (TACCCATACGATGTTCCAGATTACGCT), as a reporter to monitor 
infectivity. Each individual preparation of AAV (2/9) and (2/PHP.S™) 
was produced by transient transfection of pRC9, pHelper, and AAV- 
genome plasmid into 6-12 T225 flasks of HEK 293T cells. Viral medium 
was collected and replaced at 72 h. 293T cells and asecond round of viral 
medium were collected at 120 h post transfection. AAVs were extracted 
from cell pellets using Salt Active Nuclease (Articzymes) in40 mM Tris, 
500 mM NaCl and 2 mM MgCl, pH 8 (SAN buffer). AAVs in superna- 
tant were precipitated with 8% PEG/SOO mM NaCl and resuspendedin 
SAN buffer. Viral suspensions were loaded onto an iodixanol gradient 
(OptiPrep) and subsequently concentrated using Amicon filters with 
a100-kD cutoff to a volume of 25-30 pl (1 x PBS + 0.001% F-68) per 6 
7225 flasks transfected. Viral titres were normalized to 1 x 10“ pg/ml 
and stored at -80 °Cin 5-10-pl aliquots. AAVs (2/9) were injected intra- 
peritoneally (IP) into pups at PO. Pups were transiently anaesthetized 
by hypothermia and bevelled pipettes were used to deliver 10” viral 
genomes ina volume of 10 pl (0.01% Fast Green, 1 x PBS). After mice 
were injected, they were returned to ambient temperature and upon 
regaining full mobility were cross fostered with nursing CD1 females. 
Approximately seven days after transduction, DRG were extracted 
for subsequent experimental analysis. Upon dissection, all DRG were 
visualized and monitored for GFP expression. For behavioural experi- 
ments, aminimum of 10” viral genomes of AAV (2/PHP.S) were delivered 
to P21 mice via intravenous injection (retroorbital vein). 


Immunostaining analysis 

DRG. For immunostaining analysis, mice (P28-42) were anaesthetized 
with isoflurane and transcardially perfused with 10 ml of 1 x PBS (with 
heparin) followed by 10 ml of 1 x PBS/4% paraformaldehyde at room 
temperature. Spinal columns were then removed and rinsed in1x PBS 
and then cryoprotected overnight in 1 x PBS/30% sucrose at 4 °C, then 
embedded in NEGSO and stored at -70 °C. For cryosectioning, tissue 
blocks were equilibrated to -20 °C for 1hand then sectioned onto glass 
slides at athickness of 20-25 ppm. Slides were stored at -70 °C until they 
were ready for staining. Slides with sections were taken from freezers 
and immediately placed into 1 x PBS and washed 3x with 1x PBS for 5 min 
each at room temperature. Tissue was blocked using 1 x PBS/5% normal 
donkey serum/0.05% Triton X-100 for 1h at room temperature. Tissue 
was then washed with 1 x PBS 3 x 5 min each at room temperature. Tissue 
was then incubated in primary antibody (rabbit anti-NEUN, Millipore: 
MAB377, 1:1,000; goat anti-mCherry/tdTomato, CederLane: ABO040- 
200, 1:1,000) in 1 x PBS/5% normal donkey serum/0.05% Triton X-100 
overnight at 4 °C. Tissue was washed in 1 x PBS 3x for 5 min at room 


temperature followed by secondary antibody (donkey anti-rabbit 488, 
1:1,000; donkey anti-goat 555, 1:1,000) diluted in 1 x PBS/5% normal 
donkey serum/0.05% Triton X-100 for 1h at room temperature. Lastly, 
tissue was washed in 1 x PBS 3x for 5 min at room temperature followed 
by application of mounting medium and glass coverslip. 


Skin sections. Skin sections were immunostained as described for 
DRG sections with the following differences: Section thickness was 
55-60 um. Primary antibodies used were chicken anti-GFP, Aves: GFP- 
1020, 1:1,000; goat anti-mCherry/tdTomato, CederLane: ABO040-200, 
1:1,000; and rabbit anti-CGRP, Immunostar: 24112, 1:1,000). Allimages 
were obtained as z-stacks using a Zeiss LSM 700 confocal microscope 
using a10x or 20x objective. 


Two-plate temperature choice assay 

Animals were habituated to the behavioural apparatus for 30 min 
before experimental analysis. Animals were placed into the centre of 
two identical chambers with one chamber randomly set to 30 °C and 
the other to the test temperature indicated. Animals were recorded as 
they freely explored the arena while automatic tracking software was 
used to track animals over a5-min period. Time spent in each tempera- 
ture chamber was quantified as a fraction of total time tested and one 
temperature was tested per day. 


RNA isolation, reverse transcription, and (RT-PCR 

DRG were dissected as described above; however, instead of subjecting 
ganglia to dissociation, they were directly lysed by gentle agitation in 
Trizol at room temperature for 10 min. The RNeasy Mini (Qiagen) kit was 
used according to the manufacturer’s instructions to purify DNA-free 
RNA. RNA was converted to cDNA using 200-250 ng of RNA with the 
high-capacity cDNA reverse transcription kit (Thermo Fisher Scientific). 
qRT-PCR was performed with technical triplicates and mapped back 
to relative RNA concentrations using a standard curve built froma 
serial dilution of cDNA. Data were collected using the LightCycler 480 
SYBR Green I Master mix (Roche) on a QuantStudio 3 qPCR machine 
(Applied Biosystems). 


Statistics and reproducibility 

For all sCRNA-seq data shown all individual cells for the labelled cell 
type are shown with no downsampling or subsetting implemented 
unless explicitly indicated. Differential and comparative gene expres- 
sion analysis was conducted using a two-sided Wilcoxon rank-sum test 
with Bonferroni corrected Pvalues. Immunostaining and cell counting 
comparisons were done using atwo-sided t-test. Behavioural analysis 
was compared using a two-way ANOVA followed by a Tukey’s post-hoc 
test. AllscRNA-seq samples were derived from n = 2 biologically inde- 
pendent samples with the exception of the adult (P28-42) sample which 
was derived from n= 6 biologically independent samples. The sample 
sizes (cell numbers) for each cell type and samples sizes for other 
analyses were as follows: Fig. 1a Adult: 257 AB-field-LTMR cells, 273 
AB-RA-LTMR cells, 182 AS-LTMR cells, 1,554 C-LTMR cells, 1,440 CGRP-a 
cells, 850 CGRP-e cells, 270 CGRP-n cells, 705 CGRP-y cells, 758 CGRP-8 
cells, 333 CGRP-C cells, 2,817 MRGPRD cells, 234 proprioceptors, 761 
SST cells, 488 cold thermoceptor cells; P5: 209 AB-field-LTMR cells, 
297 AB-RA-LTMR cells, 237 AS-LTMR cells, 1,392 C-LTMR cells, 445 
CGRP-a cells, 473 CGRP-e cells, 153 CGRP-n cells, 334 CGRP-y cells, 640 
CGRP-6 cells, 243 CGRP-¢ cells, 3,019 MRGPRD cells, 104 propriocep- 
tors, 787 SST cells, 405 cold thermoceptor cells; PO: 214 AB-field-LTMR 
cells, 163 AB-RA-LTMR cells, 165 AS-LTMR cells, 739 C-LTMR cells, 284 
CGRP-« cells, 188 CGRP-€ cells, 122 CGRP-n cells, 216 CGRP-y cells, 359 
CGRP-6 cells, 122 CGRP-¢ cells, 1,704 MRGPRD cells, 103 propriocep- 
tors, 397 SST cells, 284 cold thermoceptor cells; E15.5: 61 AB-field-LTMR 
cells, 33 AB-RA-LTMR cells, 96 AS-LTMR cells, 383 C-LTMR cells, 144 
CGRP-«a cells, 45 CGRP-e cells, 26 CGRP-n cells, 97 CGRP-y cells, 208 
CGRP-6 cells, 63 CGRP-C cells, 670 MRGPRD cells, 40 proprioceptors, 


61SST cells, 128 cold thermoceptor cells, 3,196 unlabelled cells; E12.5: 
30 AB-field-LTMR cells, 20 AB-RA-LTMR cells, 30 AS-LTMR cells, 122 
C-LTMR cells, 57 CGRP-a cells, 87 CGRP-e cells, 48 CGRP-n cells, 60 
CGRP-y cells, 9 CGRP-6 cells, 37 CGRP-¢ cells, 555 MRGPRD cells, 37 
proprioceptors, 24 SST cells, 105 cold thermoceptor cells, 7,909 unla- 
belled cells; Fig. 1b E11.5: 1,951 unspecialized sensory neurons, 5,402 
sensory neuron progenitors, 2,781 neural crest progenitors; Fig. lcn=3 
biologically independent samples; Fig. 1d 1,951 unspecialized sensory 
neurons, 257 AB-field-LTMR cells, 273 AB-RA-LTMR cells, 182 AS5-LTMR 
cells, 1,554 C-LTMR cells, 1,440 CGRP-a cells, 850 CGRP-e cells, 270 
CGRP-n cells, 705 CGRP-y cells, 758 CGRP-6 cells, 333 CGRP-C cells, 2,817 
MRGPRD cells, 234 proprioceptors, 761 SST cells, 488 cold thermocep- 
tor cells; Fig. 2a, b 696 AB-field LTMR cells, 734 AB-RA-LTMR cells, 659 
A8&-LTMR cells, 3,750 C-LTMR cells, 2,072 CGRP-a cells, 1,503 CGRP-e 
cells, 555 CGRP-n cells, 1,377 CGRP-y cells, 1,895 CGRP-6 cells, 743 
CGRP- cells, 7,498 MRGPRD cells, 462 proprioceptors, 1,733 SST cells, 
1,246 cold thermoceptor cells, 1,951 unspecialized sensory neurons, 
14,982 cells with unmatched identity; Fig. 2c 1,951 unspecialized sensory 
neurons, 257 AB-field-LTMR cells, 273 AB-RA-LTMR cells, 182 AS5-LTMR 
cells, 1,554 C-LTMR cells, 1,440 CGRP-a cells, 850 CGRP-e cells, 270 
CGRP-n cells, 705 CGRP-y cells, 758 CGRP-6 cells, 333 CGRP-C cells, 2,817 
MRGPRD cells, 234 proprioceptors, 761 SST cells, 488 cold thermocep- 
tor cells; Fig. 3a10,321 cells from E11.5; Fig. 3c 159 AB-field-LTMR cells, 
385 AB-RA-LTMR cells, 203 AS-LTMR cells, 1,827 C-LTMR cells, 441 
CGRP-a cells, 334 CGRP-e cells, 348 CGRP-n cells, 417 CGRP-y cells, 
1,665 CGRP-6 cells, 196 CGRP-C cells, 3,666 MRGPRD cells, 185 pro- 
prioceptors, 802 SST cells, 717 cold thermoceptor cells; Fig. 3e n=3 
biologically independent samples; Fig. 4a, c control/knockout 776/435 
AB-field-LTMR cells, 728/1,114 AB-RA-LTMR cells, 667/927 A5-LTMR 
cells, 2,928/2,486 C-LTMR cells, 478/656 CGRP-« cells, 990/582 CGRP-€ 
cells, 721/589 CGRP-n cells, 711/540 CGRP-y cells, 1,845/2,381 CGRP-8 
cells, 417/230 CGRP-C cells, 5,556/7,508 MRGPRD cells, 446/654 pro- 
prioceptors, 1,747/1,460 SST cells, 493/675 cold thermoceptor cells; 
Fig. 4b, d control/knockout 191/254 AB-field-LTMR cells, 246/332 AB-RA- 
LTMR cells, 170/236 A&-LTMR cells, 917/800 C-LTMR cells, 706/545 
CGRP-« cells, 495/365 CGRP-e cells, 279/330 CGRP-n cells, 559/429 
CGRP-y cells, 907/605 CGRP-9 cells, 292/341 CGRP-C cells, 1,977/2,960 
MRGPRD cells, 123/213 proprioceptors, 724/835 SST cells, 427/392 cold 
thermoceptor cells; Fig. 5a-cin159 controls, 342 AB-field-LTMR cells, 
122 AB-RA-LTMR cells, 413 AS-LTMR cells, 783 C-LTMR cells, 363 CGRP-a 
cells, 314 CGRP-€ cells, 320 CGRP-n cells, 418 CGRP-y cells, 460 CGRP-98 
cells, 352 CGRP-¢ cells, 1,162 MRGPRD cells, 368 proprioceptors, 149 
SST cells, 442 cold thermoceptor cells; in Ngf’ ;Bax”, 82 AB-field-LTMR 
cells, 162 AB-RA-LTMR cells, 124 AS5-LTMR cells, 395 proprioceptors, 
2,558 ClusterA cells, 1,878 ClusterB cells, 362 ClusterC cells, 1,461 Clus- 
terD cells, 714 ClusterE cells; Extended Data Fig. 1a 257 AB-field-LTMR 
cells, 273 AB-RA-LTMR cells, 182 AS-LTMR cells, 1,554 C-LTMR cells, 
1,440 CGRP-«a cells, 850 CGRP-e cells, 270 CGRP-n cells, 705 CGRP-y 
cells, 758 CGRP-6 cells, 333 CGRP-C cells, 2,817 MRGPRD cells, 234 pro- 
prioceptors, 761SST cells, 488 cold thermoceptor cells; Extended Data 
Fig. 1b 209 AB-field-LTMR cells, 297 AB-RA-LTMR cells, 237 AS5-LTMR 
cells, 1,392 C-LTMR cells, 445 CGRP-a cells, 473 CGRP-€ cells, 153 CGRP-n 
cells, 334 CGRP-y cells, 640 CGRP-6 cells, 243 CGRP-C cells, 3,019 MRG- 
PRD cells, 104 proprioceptors, 787 SST cells, 405 cold thermoceptor 
cells; Extended Data Fig. 1c 214 AB-field-LTMR cells, 163 AB-RA-LTMR 
cells, 165 A5-LTMR cells, 739 C-LTMR cells, 284 CGRP-« cells, 188 CGRP-e 
cells, 122 CGRP-n cells, 216 CGRP-y cells, 359 CGRP-6 cells, 122 CGRP-C 
cells, 1,704 MRGPRD cells, 103 proprioceptors, 397 SST cells, 284 cold 
thermoceptor cells; Extended Data Fig. 1d 61 AB-field-LTMR cells, 33 
AB-RA-LTMR cells, 96 AS-LTMR cells, 383 C-LTMR cells, 144 CGRP-a 
cells, 45 CGRP-e cells, 26 CGRP-n cells, 97 CGRP-y cells, 208 CGRP-8 
cells, 63 CGRP-C cells, 670 MRGPRD cells, 40 proprioceptors, 61 SST 
cells, 128 cold thermoceptor cells; Extended Data Fig. le 30 AB-field- 
LTMR cells, 20 AB-RA-LTMR cells, 30 AS-LTMR cells, 122 C-LTMR cells, 
57 CGRP-a cells, 87 CGRP-e cells, 48 CGRP-n cells, 60 CGRP-y cells, 9 
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CGRP-6 cells, 37 CGRP-C cells, 555 MRGPRD cells, 37 proprioceptors, 
24 SST cells, 105 cold thermoceptor cells; Extended Data Fig. 1f Mature/ 
PS merge, P5/PO merge, E15.5/PO merge, E15.5/E12.5 merge; Extended 
Data Fig. 2a, bn=15 biologically independent sections for eachin situ 
at each axial level; Extended Data Fig. 3a—c 293 AB-RA-LTMR cells, 106 
AS-LTMR cells, 408 C-LTMR cells, 225 CGRP-a cells, 595 CGRP-¢ cells, 
127 CGRP-n cells, 329 CGRP-y cells, 199 CGRP-6 cells, 96 CGRP-C cells, 
1,103 MRGPRD cells, 131 SST cells, 656 cold thermoceptor cells; 
Extended Data Fig. 4a—e 257 AB-field-LTMR cells, 273 AB-RA-LTMR cells, 
182 AS-LTMR cells, 1,554 C-LTMR cells, 1,440 CGRP-a cells, 850 CGRP-€ 
cells, 270 CGRP-n cells, 705 CGRP-y cells, 758 CGRP-8 cells, 333 CGRP-¢ 
cells, 2,817 MRGPRD cells, 234 proprioceptors, 761SST cells, 488 cold 
thermoceptor cells, 1,951 unspecialized sensory neurons, 5,402 sensory 
neuron progenitors, 2,781 neural crest progenitors; Extended Data 
Fig. 5a 696 AB-field-LTMR cells, 734 AB-RA-LTMR cells, 659 AS-LTMR 
cells, 3,750 C-LTMR cells, 2,072 CGRP-a cells, 1,503 CGRP- cells, 555 
CGRP-n cells, 1,377 CGRP-y cells, 1,895 CGRP-6 cells, 743 CGRP-¢ cells, 
7,498 MRGPRD cells, 462 proprioceptors, 1,733 SST cells, 1,246 cold 
thermoceptor cells, 1,951 unspecialized sensory neurons, 14,982 cells 
with unmatched identity; Extended Data Fig. 5b 257 AB-field-LTMR 
cells, 273 AB-RA-LTMR cells, 182 AS-LTMR cells, 1,554 C-LTMR cells, 
1,440 CGRP-a cells, 850 CGRP-e cells, 270 CGRP-n cells, 705 CGRP-y 
cells, 758 CGRP-6 cells, 333 CGRP-C cells, 2,817 MRGPRD cells, 234 pro- 
prioceptors, 761SST cells, 488 cold thermoceptor cells; Extended Data 
Fig. 5cn=3 biologically independent samples; Extended Data Fig. 6a, 
b adult: 257 AB-field-LTMR cells, 273 AB-RA-LTMR cells, 182 AS-LTMR 
cells, 1,554 C-LTMR cells, 1,440 CGRP-a cells, 850 CGRP-¢ cells, 270 
CGRP-n cells, 705 CGRP-y cells, 758 CGRP-6 cells, 333 CGRP-C cells, 2,817 
MRGPRD cells, 234 proprioceptors, 761 SST cells, 488 cold thermocep- 
tor cells; PS 209 AB-field-LTMR cells, 297 AB-RA-LTMR cells, 237 
A6-LTMR cells, 1,392 C-LTMR cells, 445 CGRP-a cells, 473 CGRP-€ cells, 
153 CGRP-n cells, 334 CGRP-y cells, 640 CGRP-6 cells, 243 CGRP-C cells, 
3,019 MRGPRD cells, 104 proprioceptors, 787 SST cells, 405 cold ther- 
moceptor cells; PO 214 AB-field-LTMR cells, 163 AB-RA-LTMR cells, 165 
AS-LTMR cells, 739 C-LTMR cells, 284 CGRP-« cells, 188 CGRP-¢ cells, 
122 CGRP-n cells, 216 CGRP-y cells, 359 CGRP-6 cells, 122 CGRP-C cells, 
1,704 MRGPRD cells, 103 proprioceptors, 397 SST cells, 284 cold ther- 
moceptor cells; E15.5 61 AB-field-LTMR cells, 33 AB-RA-LTMR cells, 96 
A6-LTMR cells, 383 C-LTMR cells, 144 CGRP-a cells, 45 CGRP-€ cells, 26 
CGRP-n cells, 97 CGRP-y cells, 208 CGRP-8 cells, 63 CGRP-¢ cells, 670 
MRGPRD cells, 40 proprioceptors, 61SST cells, 128 cold thermoceptor 
cells; E12.5 30 AB-field-LTMR cells, 20 AB-RA-LTMR cells, 30 AS-LTMR 
cells, 122 C-LTMR cells, 57 CGRP-a cells, 87 CGRP-e cells, 48 CGRP-n 
cells, 60 CGRP-y cells, 9 CGRP-@ cells, 37 CGRP-C cells, 555 MRGPRD 
cells, 37 proprioceptors, 24 SST cells, 105 cold thermoceptor cells; 
Extended Data Fig. 7a n = 3 biologically independent samples for the 
Avil in situ, control/knockout 776/435 AB-field-LTMR cells, 728/1,114 
AB-RA-LTMR cells, 667/927 A&-LTMR cells, 2,928/2,486 C-LTMR cells, 
478/656 CGRP-a cells, 990/582 CGRP-e cells, 721/589 CGRP-n cells, 
711/540 CGRP-y cells, 1,845/2,381 CGRP-6 cells, 417/230 CGRP-C cells, 
5,556/7,508 MRGPRD cells, 446/654 proprioceptors, 1,747/1,460 SST 
cells, 493/675 cold thermoceptor cells; Extended Data Fig. 7b n=3 
biologically independent samples for the in situ, control/knockout 
191/254 AB-field-LTMR cells, 246/332 AB-RA-LTMR cells, 170/236 
A8&-LTMR cells, 917/800 C-LTMR cells, 706/545 CGRP-a cells, 495/365 
CGRP-<e cells, 279/330 CGRP-n cells, 559/429 CGRP-y cells, 907/605 
CGRP-6@ cells, 292/341 CGRP-¢ cells, 1,977/2,960 MRGPRD cells, 123/213 
proprioceptors, 724/835 SST cells, 427/392 cold thermoceptor cells; 
Extended Data Fig. 8e, ft-SNE plots represent 257 AB-field-LTMR cells, 
273 AB-RA-LTMR cells, 182 AS-LTMR cells, 1,554 C-LTMR cells, 1,440 
CGRP-a cells, 850 CGRP-e cells, 270 CGRP-n cells, 705 CGRP-y cells, 758 
CGRP-6 cells, 333 CGRP-¢ cells, 2,817 MRGPRD cells, 234 propriocep- 
tors, 761SST cells, 488 cold thermoceptor cells; Extended Data Fig. 8h 
n=3 biologically independent samples for the immunostaining; 
Extended Data Fig. 8j n = 3 biologically independent samples for the 


immunostaining; Extended Data Fig. 8kn=3 biologically independent 
samples for the immunostaining and the in situ; Extended Data Fig. 9a 
control 342 AB-field-LTMR cells, 122 AB-RA-LTMR cells, 413 AS-LTMR 
cells, 783 C-LTMR cells, 363 CGRP-a cells, 314 CGRP-e cells, 320 CGRP-n 
cells, 418 CGRP-y cells, 460 CGRP-0 cells, 352 CGRP-C cells, 1,162 MRG- 
PRD cells, 368 proprioceptors, 149 SST cells, 442 cold thermoceptor 
cells; Ngf’;Bax 82 AB-field-LTMR cells, 162 AB-RA-LTMR cells, 124 
A8-LTMR cells, 395 proprioceptors, 2,558 ClusterA cells, 1,878 ClusterB 
cells, 362 ClusterC cells, 1,461 ClusterD cells, 714 ClusterE cells; 
Extended Data Fig. 9c n=3 biologically independent samples for each 
in situ; Supplementary Data 1 257 AB-field-LTMR cells, 273 AB-RA- 
LTMR cells, 182 AS-LTMR cells, 1,554 C-LTMR cells, 1,440 CGRP-a« 
cells, 850 CGRP-e cells, 270 CGRP-n cells, 705 CGRP-y cells, 758 
CGRP-8 cells, 333 CGRP-¢ cells, 2,817 MRGPRD cells, 234 propriocep- 
tors, 761 SST cells, 488 cold thermoceptor cells. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Quality control metrics for DRG sensory neuron UMIs (considered to be low quality) were excluded from subsequent analysis. 
scRNA-seq data and canonical correlation analysis. a—e, Distribution of the f, Integration of adult/P5 (first plot), P5/PO (second plot), PO/E1S.5 (third 
number of genes discovered in each cell (individual points) ineach population plot), and E15.5/E12.5 (fourth plot) using canonical correlation analysis to find 
of sensory neuron (underlying violin plot) in adult mice (a), P5 (b), PO (c), E15.5 common sources of variation between time points. Single cells are labelled as 
(d), and E12.5 (e). Individual cells with fewer than 1,000 genes (considered to be individual points, with colour representing identified cell types and grey 

low quality) or more than 10,000 genes (considered likely to be doublets) were representing cells in the preceding time point. For n values, see Methods. 
eliminated from subsequent analysis. Individual cells with fewer than 1,000 
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Extended Data Fig. 2 |Somatosensory neuron subtype composition varies subtypes present at different percentages at different axial levels. b, Example 
across axial levels. a, Left, schematic representing which axial levels were images of smRNA-FISH for transcriptionally distinct somatosensory neuron 
quantified. Right, quantification of smRNA-FISH to determine the percentage subtypes in C6/7 (top), T7/8 (middle) and L4/5 (bottom) DRG. For n values, 

of C6/7, T7/8, and L4/5 DRG neurons that corresponds toeachtranscriptionally | seeMethods. 

defined somatosensory neuron subtype. Black dotted lines highlight the 
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Extended Data Fig. 3 | Dorsal root ganglia and trigeminal ganglia comprise 
similar subtypes of somatosensory neurons. a, t-SNE visualization of 
trigeminal ganglia scRNA-seq data obtained from adult (P28-42) mice. Colours 
denote principal cell types and dotted circles were added to aid in visualization 
of principal cell types. b, Distribution of the number of genes discovered in 
each population of sensory neuron in adult trigeminal ganglia displayed as 
violin plots. c, Heat map depicting expression of genes that are enrichedin 
somatosensory neuron subtypes resident in DRG as well as their expression 
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levels in cognate subtype counterparts in trigeminal ganglia. d, Heat map 
depicting expression of genes that are enriched in somatosensory neuron 
subtypes resident in the trigeminal ganglia as well as their expression levels in 
cognate subtype counterparts in DRG. c, d, Boxes represent IQR, whiskers 
represent minimum and maximum values, and notches represent the 95% 
confidence interval of the median. *P< 0.01, two-sided Wilcoxon rank-sum test 
with Bonferroni correction. For n values, see Methods. 
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Extended Data Fig. 4| Neural crest progenitors, sensory neuron progenitors 
and unspecialized sensory neurons express highly distinct gene programs. 
a, Heat map depicting cell cycle (S/G,/M)-associated genes for the principal 
subtypes identified at E11.5.b, Heat map depicting expression of genes 
enriched in USNs in both mature somatosensory neuron subtypes and USNs. 

c, Left, heat map depicting expression of genes enriched in USNs as wellas their 
expression in NCPs and SNPs. Right, violin and box plots depicting example 
genes enriched in USNs. d, Left, heat map depicting expression of genes 
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enriched in NCPs as well as their expression in SNPs and USNs. Right, violin and 
box plots depicting example genes enriched in NCPs.e, Left, heat map 
depicting expression of genes enriched in SNPs as well as their expressionin 
NCPs and USNs. Right, violin and box plots depicting example genes enriched 
in SNPs. a-e, Boxes represent IQR, whiskers represent minimum and maximum 
values, and notches represent the 95% confidence interval of the median. 
*P<0.01, two-sided Wilcoxon rank-sum test with Bonferroni correction. Forn 
values, see Methods. 
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Extended Data Fig. 5 | Force-directed layout of putative subtype-restricted 
transcription factors. a, Force-directed layout representation of DRG with 
expression patterns displayed for the remaining putative subtype-restricted 
transcription factors. b, t-SNE visualization of expression of Runx1, Runx3, 
Pou4f2 and Pou4f3 in the adult DRG. c, Left, smRNA-FISH for Runx1 and Runx3in 
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interval of the median. *P< 0.01, two-sided Wilcoxon rank-sum test with see Methods. 
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Extended Data Fig. 7|DRG counts and TF analysis in Pou4f2 and Pou4f3 each somatosensory neuron subtype in Pou4f3""™" (left) or Pou4f3*" (right) 
mutants. a, Representative images of Avil smRNA-FISH from T7/8 DRG littermates. d, Box plots displaying the expression of subtype-restricted TFs in 
in Pou4f3""" (left) or Pou4f3" (right) littermate DRG. Right of images, each somatosensory neuron subtype in Pou4f2""” (left) or Poudf2kore/Kocre) 
quantification of estimated cell count per DRG. b, Representative images of (right) littermates. c,d, Boxes represent IQR, whiskers represent minimum and 
AvilsmRNA-FISH from T7/8 DRG in Pou4f2*° (left) or Poud fon r/o?) maximum values, and notches represent the 95% confidence interval of the 
(right) littermate DRG. Right of images, quantification of estimated cell count median. For n values, see Methods. 


per DRG. c, Box plots displaying the expression of subtype-restricted TFs in 


A : Cc 


Bmpr1b-T2a-Cre targeting strategy Avpr1a-T2a-Cre targeting strategy 


Wild-type Exons Wild-type Exons 


S'UTR——+—+-+ 4 3'UTR 5'UTR me 3'UTR 


targeting vector Os we targeting vector 


5 arn’ arm T2a-Cre-TGA| Frt-PGK:Neo®-pA-Frt} 3’ homobeye arm 5° Rormsogy arm T2a- -Cre-TGA  Frt-PGK: Neo®-pA-F rt! 3 fentslonval arm 
Avpr1 at2a- -Cre/WT 

Bmpr1b'=-cremT Last Exon ; 

Last Exon 5’ -\T2a-Cre-TGA-Frt-PGK:Neo®pA-Frt- 3’ UTR 

5) -\T2a-Cre-TGA-Frt-PGK:Neo*-pA-Frt-l 3’ UTR Last Exon (after Neo cassette removal) 
5’-| T2a-Cre-TGA 23’°UTR 

B D 

Bmpr1 bT2aCreWT- AAV/-CAG:FLEX-GFP Avpr1 al2e-CreMT-Roga26lsl-tdtomatoT 


E Avpriat=cre; F Bmprib™<r: G Poudf2ce: | 
—___Avprta _____Rosa26'si Tense Bmprtb AAV-CAG:FLEX-GFPP#41Y AAV-CAG:FLEX-GFPP##LY a ee 
m3 CGRP 
oS _ Skin 


CGRP-y 


DRG 
DRG 


CGRP-o 


Pou4f2-subtypes 


AB RA-LT! RA-LTMR 
AS-LTMR LTMR 
C-LTMR_ LTMR 
H Avpria™*ce Bmprib™ce Poud4f2e® = 
Epicermis imma z <5 
Free endings > 
= 
Free endings Lee Zo 
! a 
4 os 
ro) 
& 
Epidermis Hair follicle i ie * * * 
OSS OSS 2 
; : SEEK SSF sor 
é 
ee Fe f 
Hair folicle 
Poud4f2kocreywr- Poudf2kocreyKo(cr)- 
J Pou4fav™t Poudf3kornKo CGRP endings K AAV-CAG:FLEX-GFPP"4 1. AAV-CAG:FLEX-GFPP"4!¥. Lanceolate endings 
E 1 
1 o 14 
< 
2 2 _2 * §8 i 
58 + Go 53 
£3 oo ao 
fc t 38 ‘Bes 
as fad ao 
rar os [ram 
Sa Ory oF 
o8 3 oo 
0 2 8. 
g@ o> # ONS 
aa ge g¢ 
23 58 8 3 
‘ 5 $3 
HaiMYeticle’ 3 Ey 


RNA-FISH 


= 
2 
u 
g 
2 
a 


Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Generation and validation of Bmprib™*“ and 
Avuprila™*“ mouse lines. a, Targeting strategy for inserting a T2a-Cre-TGA"'? 
codon: Frt-PGK:Neo®-pA-Frt cassette immediately upstream of the stop codonin 
Bmprib.b, smRNA-FISH for both Bmprib and GFP in Bmprib™“° AAV-CAG:FLEX- 
GFP" mice to confirm the specificity and utility of the Bmprlb™*“ allele.c, 
Targeting strategy for inserting a T2a-Cre-TGAS'?°°o™; Frt-PGK:Neo®-pA-Frt 
cassette immediately upstream of the stop codon in Aupria. d, smRNA-FISH for 
both Aupria and tdTomato in Avpria’™*°@% Rosa26'"t4IomateT mice to 
confirm the specificity and utility of the Avprla”*“ allele. e, Top left, t-SNE 
representation of transcriptionally mature DRG overlaying the expression 
pattern of Avpria. Remaining images, representative immunostaining images of 
tdTomato and CGRP in skin sections obtained from Avprla™* Rosa26'5"tomato 
animals. f, Top left, t-SNE representation of transcriptionally mature 

DRG overlaying the expression pattern of Bmprib. Remaining images, 
representative immunostaining images of GFP and CGRP inskin sections 
obtained from Bmpr1b™*“° AAV-CAG-FLEX-GFP"*"" animals. g, Representative 


immunostaining images of GFP in skin sections obtained from Pou4f2©,A AV- 
CAG:FLEX-GFP**"” animals. h, Quantification of ending morphology for CGRP-« 
(Aupria™*“*Rosa26's"470m°) and CGRP-n (Bmpr1b™*“°AAV-CAG:FLEX-GFP"#"") 
somatosensory neuron subtypes, as well as Pou4f2 subtypes. i, Schematic 
representation of the skin with the distinct morphological ending types of 
CGRP-a and CGRP-n neurons displayed, as well as Pou4f2 subtypes. 

j, Representative images of CGRP immunostaining in skin samples from 
2-3-week-old Pou4f3"”"" (left) or Pou4f3* (right) littermate controls. 
*P<0.01, two-tailed t-test. k, Representative images of GFP immunostaining in 
skin samples from 3-4-week-old Pou4f2’°o” (top left) or Pou4 far ceKocre 
(right) littermates; representative RNA-FISH for GFP in Pou4f2°" and 
Pou4f2korevkoCre) littermate controls are displayed below the skin 
immunostaining images. *P< 0.01, two-way ANOVA with Tukey’s HSD post-hoc 
analysis (h); two-sided f-test (j,k). Bar graphsinh,j,k show mean+s.e.m.Forn 
values, see Methods. 
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Extended Data Fig. 9 | Subtype-restricted TF expression profiles inNgf” 
Bax cellclusters.a, Heat map depicting expression of the subtype-restricted 
TFsin PO somatosensory neuron subtypes (left) and clusters from Ngf’ Bax” 
mutants (right). b, smRNA-FISH for pairs of subtype-restricted TFs in Bax” 
(top) or littermate Ngf/ Bax’ mutants (bottom). c, Quantification of the 
smRNA-FISH data showing the number of Pou4f3/Shox2 double-positive, 
Pou4f3/Hopx double-positive, Bcl11a/Hopx double-positive, Neurod] single- 
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Neurod6 
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positive or Neurod6 single-positive neurons. d, Schematized model of 

gene expression programs as cells traverse development milestones. 
Transcriptionally unspecialized sensory neurons that emerge from Sox10* and 
Neurogi’ progenitors co-express multiple TFs, which become restricted to 
select subtypes as neurons mature. These TFs are responsible for establishing 
the transcriptional specializations found in each neuronal subtype. c, *P< 0.01, 
two-sided t-test. Forn values, see Methods. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Basespace (Illumina) was used to handle bcl files extracted from NextSeq500 reads. Behavioral experiments were conducted using 
custom codes written in Bonsai and Arduino software, which are available upon request. 


Data analysis STITCH/SPRING analysis was implemented in Matlab and underlying code has been previously published and available on github, The 
CellRanger (v3.0.0) pipeline was also used to map scRNA-seq and generate Count tables data (10x Genomics). Monocle 3 and Seurat (v2 
and v3) were used with standard commands in the R environment to further analyze data and generate heatmaps and gene expression 
plots 
and generate plots for scRNA-seq data, ImageJ (Fiji) was used to analyze confocal images 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


All data is available online via https://kleintools.hms.harvard.edu/tools/springViewer_1_6_dev.html?datasets/Sharma2019/all . All raw read count tables for each 
time point is available via the gene expression omnibus. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 
DX] Life sciences [_] Behavioural & social sciences [_] Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No sample size pre-determination was performed. The largest number of cells that could be practically sequenced from an mice were done in 
all conditions. This study does not claim to saturate the discovery of all possible cell types nor saturate the detection of all possible difference. 
However, given the number of cell sequenced, we were able to confidently characterize cell types based on previously published literature as 
well as generate new Cre drivers that labeled morphologically homogenous cell types. This suggests that the number of cells sequenced is 
sufficient to support the claims of the study. 


Data exclusions _ Individual cells with high mitochondrial read count or low gene discovery were excluded from further analysis and predetermined based on 
standards previously determined - a common practice for single cell studies. The number of cells excluded were roughly equal across all time 
points and experimental conditions. 


Replication All replication attempts were successful by our best efforts. To validate scRNA-seq experiments we performed strategic in situ validation. To 
validate the faithfulness of the mouse reporter lines, we performed double in situs with the reporter and the endogenous gene driving Cre to 
confirm that the cell type of interest was properly labeled. 


Randomization = Animals used for behavior were litter mates (test and control groups) and randomly selected. All time points for scRNA-seq were performed in 
random orders. 


Blinding Blinding was not possible in our study as all conditions were sequenced in parallel so the investigator could not be aware of the results until 
after completion of the experiment. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 

T | Antibodies |] ChiP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Rabbit Anti-NeuN, Millipore: MAB377, 1:1000. Goat Anti-mCherry/tdtomato, CederLane: ABO040-200, 1:1000, Chicken Anti-GFP, 
Aves: GFP-1020, 1:1000, Rabbit Anti-GFP, ThermoFisher: A6455, 1:1000, Rabbit Anti-CGRP, Immunostar: 24112, 1:1000. 
Secondary antibodies included Alexa 488 or 546 conjugated goat anti-rabbit antibodies, goat anti-chicken antibodies, All 
secondary antibodies were purchased from Life Technologies except for Alexa 488 conjugated donkey anti-chicken antibodies, 
which was purchased from Jackson ImmunoResearch, and all were used at 1:500 dilution 


Validation All antibodies were validated in previous studies from our lab and others. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) 293T cells (ATCC) 


Authentication Cells lines were not authenticated in this study - they were only used as a tool to generate adeno-associated virus 


= 
jad) 
ou 
e 
= 
o 
= 
o 
Za) 
© 
je’) 
= 
a 
= 
= 
io 
ne) 
2) 
ee 
= 
Co) 
Za) 
S 
3 
= 
fev) 
= 
< 


Mycoplasma contamination Cell lines were not tested for myco contamination 


Commonly misidentified lines No commonly misidentified cell lines were used 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals All data presented used mice on the C57BI/6N or C57BI/6J background. When possible littermate controls were used. Most 
animals were used in the 1-6 week old range, with a few labeled exceptions. 
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Wild animals This study did not involve the use of wild animals 
Field-collected samples This study did not involve the use of wild animals 
Ethics oversight Harvard Medical School |ACUC 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Alzheimer’s disease is an incurable neurodegenerative disorder in which 
neuroinflammation has acritical function’. However, little is known about the 
contribution of the adaptive immune response in Alzheimer’s disease’. Here, using 
integrated analyses of multiple cohorts, we identify peripheral and central adaptive 
immune changes in Alzheimer’s disease. First, we performed mass cytometry of 
peripheral blood mononuclear cells and discovered an immune signature of 
Alzheimer’s disease that consists of increased numbers of CD8* T effector memory 
CD45RA* (Teva) Cells. Ina second cohort, we found that CD8* Tryp, cells were 
negatively associated with cognition. Furthermore, single-cell RNA sequencing 
revealed that T cell receptor (TCR) signalling was enhanced in these cells. Notably, by 
using several strategies of single-cell TCR sequencing in a third cohort, we discovered 
clonally expanded CD8* Typ, Cells in the cerebrospinal fluid of patients with 
Alzheimer’s disease. Finally, we used machine learning, cloning and peptide screens 
to demonstrate the specificity of clonally expanded TCRs in the cerebrospinal fluid of 
patients with Alzheimer’s disease to two separate Epstein-Barr virus antigens. These 


results reveal an adaptive immune response in the blood and cerebrospinal fluid in 
Alzheimer’s disease and provide evidence of clonal, antigen-experienced T cells 
patrolling the intrathecal space of brains affected by age-related neurodegeneration. 


Neuroinflammation is a pathological hallmark of Alzheimer’s disease 
(AD). Although much effort has been dedicated to understanding 
innate inflammation in AD, little is known about the adaptive immune 
response. The lymphatic system of the brain carries immune cells from 
the cerebrospinal fluid (CSF) and connects to the deep cervical lymph 
nodes’, enabling peripheral T cells to respond to brain antigens. How- 
ever, whether T cells enter the brain to perpetuate neuroinflammation 
in AD is unknown. 

Interaction between the T cell receptor (TCR) and antigen presented 
by the major histocompatibility complex (MHC) is critical to adap- 
tive immunity. When T cells recognize cognate antigen, they clonally 
expand‘. TCR sequences are so diverse that they are essentially unique 
to an individual T cell. Thus, finding two or more T cells with the same 
TCR sequence is evidence of clonal expansion®. Several small studies 
have reported changes in the distribution®®, function and cytokine 


secretion of peripheral T cells!©” in AD (Supplementary Table 1), but 
the antigens that drive these changes are unknown. 

We integrated analyses of multiple cohorts and used several meth- 
ods to assess adaptive immunity in AD (Fig. 1a). First, we used mass 
cytometry to study peripheral blood mononuclear cells (PBMCs) from 
patients with AD and patients with prodromal mild cognitive impair- 
ment (MCI) (cohort 1; Fig. 1a, Supplementary Table 2). We age-matched 
patients to cognitively typical, healthy control individuals (Extended 
Data Fig. 1a). In addition, we confirmed diagnoses as MCI or AD by: (1) 
reduced cognitive scores (Extended Data Fig. 1b); (2) reduced ratios 
of amyloid-f (AB):phosphorylated tau and AB:total tau within the CSF 
(Extended Data Fig. 1c, d); and (3) volumetric loss of brain regions as 
measured by magnetic resonance imaging (MRI) (Extended Data 
Fig. le). We developed a panel of immune markers (Supplementary 
Table 3) that enabled major subsets of PBMCs to be identified (Extended 
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Fig. 1| Peripheral CD8* T,.,,, cells are increased in AD and are negatively 
associated with cognition. a, Four cohorts were used to assess adaptive 
immunity in AD. b, Representative SPADE trees of PBMCs from healthy 
individuals and patients with MCI or AD in cohort 1 show anincreased 
abundance of a CD8’ cluster (cluster 63) in patients with MCI or AD. Background 
tree nodes are sized according to cell counts. Insets are coloured according to 
CD8 expression. c, Quantification of cluster 63 as a percentage of total PBMCs. 
The percentage of cluster 63 cells is significantly higher in patients with MCI or 
ADthan healthy control individuals. Mean +s.e.m.; unpaired two-sided t-test 
with Welch’s correction. d, Marker expression analysis of cluster 63 
corresponds toa CD3°CD8*CD45RA‘*CD27 Tera population. Datainc, d were 
pooled from seven independent experiments with similar results. e, Linear 
regression showing the inverse correlation between cognitive score and the 
percentage of CD8* Tera cells in individuals from cohort 2. Pearson’s 
correlationrvalues are shown for each group. The significance of the difference 
between the two data sets was measured by ANCOVA. f, Stimulation with PMA 
andionomycin (stim.) induces increased expression of IFN-y in CD8°T cells 
from patients with MCI or AD. Mean +s.e.m.; unpaired two-sided t-test with 
Welch's correction. g, Differential expression analysis (scRNA-seq) of CD8* 
Temra cells from healthy individuals (n=7) and patients with MCI or AD (n= 6) 
shows upregulated TCR signalling. Model-based analysis of single-cell 
transcriptomics (MAST) differential expression test with Benjamini-Hochberg 
correction. h, Pathway analysis of differentially expressed genes in CD8" Tempra 
cells from patients with MCI or AD (n=6 subjects) versus healthy individuals 
(n=7 subjects) shows increased antigenic stimulation of CD8* Typ, cellsin 
patients with MCI or AD. Fisher’s exact test with Benjamini-Hochberg 
correction. Pathways (circles) with positive z-scores are coloured red; those 
with negative z-scores are coloured blue. The size of the circle corresponds to 
the size of the z-score (two-sided). 


Data Fig. 2). We then used spanning-tree progression analysis of den- 
sity-normalized events (SPADE) to perform unsupervised clustering 
(Extended Data Fig. 3a). Notably, we detected an increase in a popula- 
tion of CD8’ cells in patients with MCI or AD (cluster 63; Fig. 1b). Plotting 
all SPADE clusters for P value versus fold change, the cluster that was 
most highly increased among patients was cluster 63 (Extended Data 
Fig. 3b). Quantification of individual subjects revealed higher values 
for this cluster in patients with MCI or AD than controls (Fig. 1c). Finally, 
marker expression for this cluster corresponded to CD3*CD8*CD27 T 
effector memory CD45RA* (Tera) Cells (Fig. 1d)—a T cell population 
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with potent effector functions that include the secretion of proinflam- 
matory cytokines and cytotoxic molecules”. 

We next used cluster identification, characterization and regres- 
sion (CITRUS) to determine whether clusters could predict disease 
status. Notably, CITRUS identified a significantly altered cluster (arbi- 
trarily numbered 229992) that corresponds to CD3*CD8*CD45RA* 
T cells (Extended Data Fig. 3c). Quantification revealed higher per- 
centages of this population in patient PBMCs than controls (Extended 
Data Fig. 3d). Moreover, marker expression again pointed to Tryra 
cells (Extended Data Fig. 3e). Next, we used a regularized supervised 
learning algorithm from CITRUS to determine populations of T cells 
that predicted whether a sample belonged to healthy groups or those 
with disease. Notably, cluster 229992 combined with nine additional 
clusters was 80% predictive of disease (Extended Data Fig. 3f, Supple- 
mentary Table 4). Next, we used our mass cytometry dataset to derive 
41immune variables, which showed differences between patients and 
healthy controls in relation to CD8* T cells (Supplementary Table 5). 
Specifically, we detected an increase in CD8° T cells in patients versus 
controls (Extended Data Fig. 4a), with a concomitant decrease in the 
ratio of CD4* to CD8" T cells (Extended Data Fig. 4b). Subsets of CD8* 
Tcells were also altered in PBMCs from patients with MCI or AD: effector 
cells were overrepresented (Extended Data Fig. 4c), whereas memory 
cells were underrepresented (Extended Data Fig. 4d). Together, these 
results demonstrate an adaptive immune signature of AD that consists 
of increased peripheral CD8* Tyypa Cells. 

To further investigate the role of CD8* T cells in MCI and AD, we 
assessed the relationship between cognition and populations of mem- 
ory T cells ina separate cohort (cohort 2) (Fig. 1a, Extended Data Fig. 5a). 
This revealed a negative correlation in MCI and AD between CD8° Trwpa 
cells and cognition (Fig. le), whereas percentages of T central memory 
(Tc) and T effector memory (T;y,) cells were positively correlated 
(Extended Data Fig. 5b). Age did not influence the levels of CD8* Trpa 
cells in either group (Extended Data Fig. 5c). We next tested whether 
functional differences exist in the peripheral CD8* T cells of patients 
with MCI or AD. We stimulated PBMCs with phorbol 12-myristate 
13-acetate (PMA) and ionomycin, and then measured the levels of two 
effector cytokines: interferon-y (IFN-y) and tumour necrosis factor 
(TNF) (Extended Data Fig. 5d). After stimulation with PMA, CD8’ T cells 
from patients with MCI or AD had significantly higher levels of IFN-y 
than did control CD8*' T cells (Fig. 1f), and TNF also trended towards 
significance (Extended Data Fig. 5e). Overall, these data identify an 
antigen-experienced population of CD8* Try, cells that has potent 
effector functions—including the ability to secrete proinflammatory 
cytokines—in the peripheral immune system of patients with MCI or AD. 

The proinflammatory effector phenotype of peripheral antigen- 
experienced CD8* Typ, cells in MCI and AD prompted us to test whether 
CD8* Tempra Cells from patients with MCland AD weretranscriptionally 
distinct. We performed droplet-based single-cell RNA sequencing 
(scRNA-seq) on sorted CD8* Typ, cells from populations of PBMCs 
from patients with MCI or AD and control individuals (Extended Data 
Fig. 6a). Differential expression and pathway analysis revealed a sig- 
nificant decrease in amino acid metabolism and an increase in TCR 
and cytokine signalling in CD8* Tega cells in MCI and AD (Fig. 1g, h, 
Extended Data Fig. 6b, c). Collectively, these results indicate greater 
antigenic stimulation of peripheral CD8* Teypa cells in patients with 
MCland AD than controls. 

We next sought to determine whether CD8* T cells were present in 
the brain of patients with AD. We assessed post-mortem brains froma 
third cohort (cohort 3) comprising control individuals (no neurologi- 
cal disease) and patients with AD (Supplementary Table 6) by using 
immunohistochemistry to examine the expression of CD8 and AB and 
analyse the proximity of CD8* T cells to the cerebral vasculature. We 
noted numerous extravascular CD8* T cells in the perivascular space 
of blood vessels with cerebral amyloid angiopathy in the hippocampi 
of three AD brains (Fig. 2a); by contrast, CD8* T cells were not observed 
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Fig. 2|CD8* T cells enter the brain in patients with AD. a, Confocal imaging of 


cerebral amyloid angiopathy (CAA) inthe post-mortem brain of a patient with 
AD from cohort 3 shows CD8*T cellsin the perivascular space of AB* blood 
vessels with cerebral amyloid angiopathy in three AD-affected hippocampi. 
Arrowheads indicate CD8’ T cells; asterisks indicate blood vessel lumen. Scale 
bars, 20 pm. b, Higher numbers of CD8* T cells were detected in AD-affected 
than control hippocampi. Mean +s.e.m.; unpaired two-sided t-test with Welch’s 
correction.c, A CD8* T cell is shown associated with MAP2* neuronal processes. 
d, CD8* T cells are localized to the leptomeninges and adjacent to hippocampal 
AB plaques. Scale bar, 100 pm. Datainc, d were replicated in three independent 
experiments. 


surrounding blood vessels in control brains (Extended Data Fig. 7a). 
Notably, we detected CD8*' T cells adjacent to AB plaques (Extended 
Data Fig. 7b-d). We also detected significantly more CD3*CD8* T cells 
in AD-affected hippocampi than control hippocampi (Fig. 2b, Extended 
Data Fig. 7e). CD8* T cells associated with microtubule associated 
protein 2 (MAP2)* neuronal processes in AD-affected hippocampi 
(Fig. 2c). We also noted an association of CD8* T cells with neurofila- 
ment heavy (NEFH)* neuronal processes in an APP/PS1 mouse model of 
AD (Extended Data Fig. 7f), and confirmed this association by electron 
microscopy (Extended Data Fig. 7g). Finally, we also detected CD3*CD8* 
T cells in the leptomeninges adjacent to the hippocampus in brains 
from patients with AD (Fig. 2d). 

The localization of CD8* T cells to brain leptomeninges led us to 
investigate whether antigen-specific cells also patrol the CSF. CSF 
immunity is relatively uncharacterized in healthy elderly individu- 
als", Therefore, we enumerated immune cells in CSF from ten healthy 
elderly subjects (Extended Data Fig. 8a). We found that CSF from these 
individuals contained mostly T cells, with a minority of innate immune 
cells and an undetectable amount of B cells (Extended Data Fig. 8b). 
Notably, the CD8* T cell repertoire of CSF from elderly individuals 
was composed almost exclusively of T,,, cells, and Tyga cells made up 
around 20% of this T,,, population (Extended Data Fig. 8c). 

AS Tempra Cells are associated with immunological memory, we inves- 
tigated whether clonally expanded T cells patrol the CSF in healthy 
individuals, patients with AD and patients with Parkinson’s disease 
(PD; used as a control because antigen-specific T cells were recently 
discovered in vitro in PBMC samples from patients with PD”) ina sepa- 
rate cohort (cohort 4; Fig. 3a, Supplementary Table 2). We used two 
methods toassess clonal expansion: plate-based sequencing (plate-seq) 
and droplet-based sequencing (drop-seq). We first performed plate- 
seq of TCRs from CSF T cells of healthy controls (n =5), patients with 
AD (n=3) and patients with PD (n=2) (Extended Data Fig. 8d). Clonally 
expanded cells were mostly CD8' T cells, although we also detected 


CD4' T cell clones in nearly all subjects (Extended Data Fig. 8e, Sup- 
plementary Table 7). Strikingly, one patient with AD had a massively 
expanded clone that comprised 44% of all CD8* TCRs (Fig. 3b). Analysis 
of marker expression identified this clone as CD8*CD45RA*CD27 Tryin 
cells (Fig. 3c). Together, these results provide the first evidence (to our 
knowledge) that clonally expanded CD8° T cells patrol the CSF in brains 
that are affected by age-related neurodegeneration. 

To examine potential changes in gene expression in clonal CD8* 
T cell populations in AD, we sorted live cells of the CSF (Extended Data 
Fig. 8f) and used drop-seq to validate our plate-seq findings. Multidi- 
mensional reduction and visualization with t-distributed stochastic 
neighbour embedding (t-SNE) showed that most CSF cells were T cells 
(Fig. 3d, e). This cellular and molecular characterization of the CSF in 
elderly individuals and patients with AD indicates that the intrathecal 
immune compartment contains almost exclusively T cells of the T,;., 
subtype. Clusters were composed of a mixture of groups, sexes and 
patients (Extended Data Fig. 8g). We next performed single-cell TCR 
sequencing (scTCR-seq) on these cells, which showed that clonal T cells 
colocalized with CD8*' T cells (Fig. 3f). We observed numerous highly 
expanded CD8*' clones in CSF from patients with MCI or AD, whereas 
this expansion occurred less frequently in CSF from control individu- 
als (Fig. 3g). Quantification of the most highly expanded (maximum) 
clone for each subject revealed that the percentages of the maximum 
clone were higher in patients with MCI or AD than in control individu- 
als (Fig. 3h). Differential expression analysis of clones that were highly 
expanded in MCl and AD (that is, more than five T cells with the same 
TCRafB sequence) revealed increased expression of cytotoxic effec- 
tor genes, including natural killer cell granule protein 7 (NKG7) and 
granzymes A, H and K (GZMA, GZMH and GZMK) (Fig. 3i). We then 
quantified the percentages of T cells that corresponded to highly 
expanded clones, which showed that 49.13% were CD8* Temp, Cells 
(Fig. 3j). Differential expression analysis of clonally expanded CD8* 
Tera Cells also revealed increased expression of cytotoxic effector 
genes including NKG7 and GZMA in MCl and AD (Fig. 3k). Notably, 
CD8* Tempra Cells from patients with MCI or AD also expressed higher 
levels of the MHC genes human leukocyte antigen C (HLA-C) and 
beta-2-microglobulin (B2M)—a pro-ageing factor that impairs cogni- 
tion’’—compared with control cells (Fig. 3k). In addition, differential 
expression of highly expanded clones from patients with MCI or AD 
showed that these clones were CD8-positive and that they upregulated 
their expression of cytotoxic granules, proteases and proinflammatory 
cytokines (Extended Data Fig. 9a), many of which are associated with 
AD (Supplementary Table 8). Although it is not clear whether cells enter 
the brain parenchyma via the CSF, we localized GZMA expression to 
CD8*° T cells (Fig. 31) and detected higher percentages of GZMA*CD8* 
cells in hippocampi from patients with AD than in hippocampi from 
control individuals (Fig. 3m). Collectively, these results reveal the pro- 
inflammatory, cytotoxic function of clonal CD8* Tyypa Cells in the CSF of 
patients with AD, and demonstrate the utility of combining scRNA-seq 
and scTCR-seq datasets. 

We next pooled TCRaf sequences from plate-seq and drop-seq 
experiments in cohort 4 to broadly assess clonality in neurodegen- 
eration. We first separately analysed the maximum clones (defined as 
comprising 3% or more of all TCRaB sequences) from healthy subjects 
and patients with MCI, AD and PD, which revealed a highly expanded 
clonein only one out of ten healthy subjects, compared with four out of 
six patients with AD, two out of six patients with PD and one out of five 
patients with MCI (Extended Data Fig. 9b). We next measured overall 
clonality (defined as the percentage of total TCRafs sequences that are 
identical to one or more TCRaf sequences), which revealed that the 
levels of clonality were highest in AD and PD (Extended Data Fig. 9c). 
Notably, when we integrated CSF cells from patients with PD into our 
analysis (Extended Data Fig. 9d), we observed increased expression 
of genes analogous to those observed in AD clones (Extended Data 
Fig. 9e). Together, these results demonstrate the clonal expansion of 
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Fig. 3 | Clonal expansion of CD8* T,y,, Cells in the CSF of patients with AD. 

a, Plate-seq and drop-seq methods used for scTCR-seq and scRNA-seq of 
immune cells of the CSF in patients from cohort 4. b, CD8* TCRaf clonality 
(plate-seq) in the CSF of patients with AD and healthy control individuals. 

c, The top (most expanded) clone in AD had a marker expression profile of 
CD8*CD45RA‘*CD27  Trmpa Cells. Data were replicated in two independent 
experiments. d, CSF cells analysed by drop-seq and clustered by 
multidimensional reduction with t-SNE, showing populations of immune cells 
that include CD8* Tyypa cells (n = 9 healthy control individuals (10,876 cells); 

n=9 patients with MCl or AD (10,391 cells)). e, Marker expression of CSF 
clusters, including CD8* Teypa cells. CD62L is also known as SELL; CD1Icis also 
known as/7TGAX. Data were pooled from three independent experiments. 

f, Concentration of clonal cells in locations of CD8’ T cell clusters (n=9 subjects 
per group). g, Representative plots of CD8* TCRaf clonality (drop-seq) in 
age-matched subjects shows enhanced clonal expansion and more highly 
expanded clones in AD. Clones are coloured by proportion of the total TCRaB 
sequences. h, Quantification of maximum clones (% TCRaB sequences) shows a 


cytotoxic proinflammatory CD8* Tyypq cells in the CSF of patients with 
age-related neurodegeneration. 

We next sought to determine the antigens that drive clonal expan- 
sion of CD8* Teypa Cells in the CSF of patients with MCI or AD. We cre- 
ated unweighted networks of our clonal TCRaB sequences to detect 
shared clones within and between groups. Although we did not identify 
shared TCRaB sequences in healthy subjects within or between groups, 
we detected a shared TCRaf clone between a patient with MCI anda 
patient with AD (Fig. 4a). Notably, a third patient with AD shared this 
TCRB sequence (CASSLGQAYEOQYF), which has known specificity for 
the Herpesviridae Epstein-Barr nuclear antigen 3 (EBNA3A)” (Fig. 4b). 
We then assessed the gene-expression profile of this Epstein-Barr virus 


402 | Nature | Vol577 | 16 January 2020 


higher percentage in patients with MCI or AD than healthy control individuals 
(n=9 subjects per group). Samples lacking clonal cells were scored as zero. Box 
plots show median and 25th to 7Sth percentiles, and whiskers indicate the 
minimum and maximum values. Unpaired two-sided t-test with Welch’s 
correction. i, Differential expression of highly expanded clones (clonal 

TCRaf > 5) revealed increased expression of cytotoxic effector genes. MAST 
differential expression test with Benjamini-Hochberg correction (n=9 
patients with MCI or AD).j, Quantification of highly expanded clones (clonal 
TCRaf > 5) showed that 49.13% of them are CD8* Typ, Cells (n =9 patients with 
MClor AD).k, Increased expression of B2M, NKG7 and GZMA in clonal CD8* 
Tcells from patients with MCl or AD. MAST differential expression test with 
Benjamini-Hochberg correction (n=9 subjects/group). I, A hippocampal CD8* 
Tcellinan AD-affected brain (cohort 3) shows expression of GZMA adjacent to 
MAP2‘ neuronal processes. Scale bar, 5 pm. Data were replicated in three 
independent experiments. m, Percentages of CD8* T cells that express GZMAin 
control and AD-affected hippocampi. Mean +s.e.m.; unpaired two-sided t-test 
with Welch’s correction. 


(EBV) EBNA3A-specific clone, and found that the expression of cyto- 
toxic effector genes was increased (Fig. 4c). 

To determine the antigen specificity of unknown TCRs, we applied 
HLA haplotyping within our GLIPH (grouping of lymphocyte interac- 
tions by paratope hotspots) algorithm” to our plate-seq TCRs. GLIPH 
can analyse large numbers of TCR sequences and define specificity 
groups that are shared by TCRs. Notably, GLIPH identified a shared 
TCRBP chain between patients with AD (CASSLAGGYNEQFF); this 
chain also shared homology with a TCRB chain from a third patient 
(CASSLGTGNNEOFF) (Fig. 4d, Supplementary Table 9). To test for 
antigen specificity of GLIPH-derived TCRs, we generated two cell 
lines—termed TCRaf 1 and TCRaf 2—that express TCRaB sequences 
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Fig. 4 | Antigen identification of clonally expanded TCRs in the CSF of 
patients with AD. a, Unweighted network analysis of CD8 TCRaB sequences 
combined from plate-seq and drop-seq experiments. Group node IDs with 
individual TCRaf clones are depicted as circles and sized according to the 
proportion of total sequences of eachclone. Arrow indicates a shared clonal 
TCRaf sequence with specificity for EBV EBNA3A. Note that several healthy 
control (HC) subjects have no clones. b, Shared TCRaB sequences among 
patients with MCI or AD. Three patients had identical TCRB chains with 
specificity for EBV EBNA3A. The antigen specificity of TCRB is shown below”. 
c, Differential expression of EBV-specific clones in MCl and AD (fromn=3 
subjects) versus all CSF T cells shows enhanced expression of cytotoxic 
effector genes. MAST differential expression test with Benjamini-Hochberg 
correction. d, Workflow for antigen identification of CSF TCRs. GLIPH was 
applied to TCR sequencing to derive homologous TCR sequences between 
patients. GLIPH identified two patients with AD who had identical TCRB chains 


from these two patients with AD (patients 1 and 2) (Fig. 4d, Extended 
Data Fig. 10a). Both TCRaf 1 and TCRaf 2 cells showed upregulation 
of the activation marker CD69 when stimulated in a cell-free manner 
(Extended Data Fig. 10b). We then used dermal fibroblasts from patients 
land 2 as autologous antigen-presenting cells to screen a pool of 80 
known MHC-I restricted epitopes (Supplementary Table 10) for reac- 
tivity. Notably, TCRaB 1 cells responded to the MHC-I peptide pool, 
whereas TCRof 2 cells were non-responsive (Fig. 4e, Extended Data 
Fig. 10c). TCR activation depended on autologous antigen presentation, 
as non-autologous presentation did not stimulate TCRaf 1 cells—dem- 
onstrating MHC restriction (Extended Data Fig. 10d). 

We then narrowed this MHC-I pool to 15 peptides that could activate 
TCRaf I cells, by filtering for peptides restricted to an HLA that matched 
patient 1 (HLA-A*01:01 or HLA-B*08:01) (Supplementary Table 11). 
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anda third patient with a similar sequence. The TCRaf sequences derived from 
GLIPH were introduced into SKW-3 cells. e, Autologous fibroblasts were used to 
present antigens to TCRaB1and TCRaf 2 cells. Only TCRaB 1 cells showed 
significant upregulation of CD69 following antigen presentation. Data are 
averages from three separate experiments performed intriplicate. 

Mean +s.e.m.; one-way analysis of variance (ANOVA) (F(3, 8) =1,050, 

P=1,01x 107°) with Tukey’s multiple comparisons test. f, Peptide 7 (RAKFKQLL) 
of the EBV trans-activator BZLF1 protein activates TCRaf 1 but not TCRap 2 
cells. Two-way ANOVA (F(14, 60) =14.06, P=4.8 x 10 “) followed by Sidak’s 
multiple comparisons test. Data were pooled fromn=3 independent 
experiments. The Pvalue shown is from comparing peptide 7 values for each 
cellline. Mean+s.d.g, A fluorescent dextramer composed of HLA-B*08:01 
presenting the BZLF1 peptide RAKFKQLL shows nearly 100% positivity with 
TCRaf 1 but no positivity with TCRaf 2 cells. Unpaired two-sided t-test with 
Welch's correction (n=6 per group). 


We used TCRaf 2 as a control, given the sequence similarity of 
its B chain, separate patient HLA and inability to become activated 
by the MHC-I peptide pool. Only one peptide activated TCRaf 1 
cells: peptide 7-RAKFKQLL of the EBV trans-activator protein BZLF1 
(Fig. 4f, Extended Data Fig. 10e). To verify the antigenicity of BZLF1 
to TCRaf 1 cells, we designed a dextramer to present RAKFKQLL on 
HLA-B*08:01 (Fig. 4g). Notably, the dextramer bound to TCRaf I cells 
with 100% positivity, whereas TCRa® 2 cells did not bind (Fig. 4g, 
Extended Data Fig. 10f). These results demonstrate BZLF1 peptide 
RAKFKQLL as a cognate antigen presented by HLA-B*08:01 fora previ- 
ously undescribed TCR (TCRa: CAASEGGFKTIF; TCRB: CASSLGTGN- 
NEQFF).Insummary, our TCRaf cloning strategy led to the discovery 
ofanovel TCRin AD, with specificity for the EBV trans-activator protein 
BZLF1. 


Nature | Vol577 | 16 January 2020 | 403 


Article 


Overall, these results demonstrate antigen-specific clonal expansion 
of CD8*T cells in AD and signify the need for a greater understanding 
of the role of adaptive immunity in this disease. Although we detected 
EBV-specific TCRs in CSF from patients with AD, we caution that these 
data are not evidence of a causal link between EBV infectivity and AD. In 
fact, the EBV-specific clones detected in this study were not the most 
highly expanded and were not enriched among patients. However, it 
remains tenable that the most highly expanded clones observed in 
patient CSF are specific for non-self antigens that were not identified 
in our search. Several different antigens may activate and promote 
CD8* T cell trafficking in AD, with a potential effect on neurodegenera- 
tion through their cytotoxic effector function. Identifying the self and 
non-self antigens that clonal TCRs recognize in amyloidogenic brain 
disorders suchas AD and PD will provide anew approach to investigate 
adaptive immunity in neurodegenerative disease. 
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Methods 


Study participants 

Samples were acquired through the Stanford Brain Rejuvenation Pro- 
gram, the NIA funded Stanford Alzheimer’s Disease Research Center 
(ADRC), the University of California at San Francisco ADRC and the 
University of California at San Diego ADRC. Collection of brain tissue, 
plasma, PBMCs and CSF was approved by the Institutional Review Board 
of each university, and written consent was obtained from all subjects. 
A total of 164 living subjects from three separate cohorts (cohorts 1, 
2 and 4) were used in this study. The 164 subjects included 97 healthy 
individuals, 31 patients with MCI, 28 patients with AD and 8 patients 
with PD. The average age of subjects was 72.52 + 6.96 (mean +s.d.) for 
healthy individuals; 70.97 + 7.82 for patients with MCI; 70.74 + 7.01 for 
patients with AD; and 67.25 + 7.01 for patients with PD. The average 
cognitive score was 27.17 + 2.32 for healthy individuals; 23.39 + 4.31 for 
patients with MCI; 11.05 + 7.32 for patients with AD; and 27.29 +1.70 for 
patients with PD. All subjects were free from acute infectious diseases 
and in good physical condition. Patients were confirmed to have neuro- 
degeneration by measurement of biomarkers including neurofilament, 
AB and tau (Quanterix). Group characteristics, including demographic, 
genetic, clinical and biomarker data for each group, are presented in 
Supplementary Table 12. 


Tissue collection 

PBMCs were isolated from blood by layering diluted blood (1:1in PBS) 
on top of an equal volume of Ficoll, followed by centrifugation and 
isolation of the buffy coat. CSF was collected by lumbar puncture, 
then centrifuged at 300g to pellet immune cells. CSF samples were 
checked for blood contamination by resuspending the pelleted cells 
in100 pl CSF and mixing 10 pl (10%) CSF with 10 pl trypan blue to assess 
red blood cell content and viability. Cells were visualized on a TC20 
automated cell counter (BioRad) and cell viability and the presence 
or absence of red blood cells was recorded. CSF samples that were 
contaminated with blood were not used in the study. The resuspended 
cells were then mixed with 900 I recovery cell culture freezing medium 
(Thermo Fisher Scientific). Allsamples were frozen overnight at -80 °C 
inaMr. Frosty freezing container (Thermo Fisher Scientific) and trans- 
ferred the following day to liquid nitrogen for storage. PBMC samples 
were stored over the course of a three-year period. The average PBMC 
viability for patient samples following sample thawing was 79%, and 
that for control samples was 77%. CSF samples were stored less than 6 
months before analysis. 


Cognitive testing 

Study subjects underwent a battery of neuropsychological assessments 
to determine group status, including cognitive examination, evalua- 
tion of cerebellar function, deep tendon reflexes, sensory input and 
motor function. The Montreal Cognitive Assessment (MoCA)” exami- 
nation was used to test study subjects for cognitive impairment. The 
MoCA assesses several cognitive domains: short-term memory recall 
(5 points), visuospatial abilities (4), executive functions (4), attention 
(1), concentration (3), working memory (1), language (6) and orientation 
to time and space (6). MoCA scores range between 0 and 30. 


Structural brain MRI 

T1-weighted MRI scans were acquired using an Axial 3D fast spoiled 
gradient sequence (GE Discovery 750). The imaging parameters were 
optimized for grey and white matter tissue contrast with a repetition 
time of 5.9 ms, echo time 2 ms, flip angle 15°, field of view 220 mm, 
matrix size 256 x 256, slice thickness 1 mm and 2 NEX. Image analysis 
was conducted using FreeSurfer 6.0. For subcortical segmentation, 
we used the ‘recon-all’ command. For hippocampal segmentation, we 
appended the flag ‘-hippocampal-subfields-T1’ to the ‘recon-all’ com- 
mand for each patient. To correct for sex differences, we normalized 


all volumetric measurements to the total intracranial volume for each 
patient. 


Flow cytometry 

Flow cytometry was conducted using an LSRFortessa (BD Biosciences). 
A panel consisting of antibodies conjugated to six different fluoro- 
phores was used to classify subsets of memory T cells and for drop-seq. 
Antibodies used were: CD8a-Pacific blue (BioLegend), CD3-BV650 
(BD Biosciences), CD45RA-APC-Cy7 (BioLegend), CCR7-488 (Bio- 
Legend), IL-7Ra-PE (BioLegend) and CD27-PE-Cy7 (BioLegend). For 
characterization of CSF cells, this same panel was used, but CD19- 
PE-Cy5 (BioLegend) and CD14-Qdot-705 (Thermo Fisher Scientific) 
were included. For sorting CSF T cells for TCR plate-seq, the following 
antibodies were used: CD8a-PE (BioLegend), CD161-PE-Cy7 (BioLeg- 
end), CXCR3-APC (BioLegend), CD4-APC-Alexa700 (Thermo Fisher 
Scientific), CD39-APC-Cy7 (BioLegend), CD38-FITC (BioLegend), PD- 
1-BV421 (BD Biosciences), CD45RA-BV605 (BD Biosciences), CD3-BV650 
(BD Biosciences), CD27-BV786 (BD Biosciences) and CD127-BUV395 
(BD Biosciences). For each experiment, a compensation matrix was 
developed using singly stained and unstained controls or fluorescent 
beads, and all analysis was conducted in Cytobank. 


Cell stimulation 

PBMCs were thawed and plated at a density of 1 x 10° cells per well in 
a 24-well plate. The medium consisted of RPMI with 10% fetal bovine 
serum and 1x penicillin-streptomycin. After overnight incubation at 
37 °C with 20 U mI IL-2, cells were stimulated witha cocktail containing 
PMA, ionomycin and brefeldin A (BioLegend). Cells were then incu- 
bated an additional 5 h before performing intracellular flow cytometry 
analysis. 


Mass cytometry 

Mass cytometry was performed as previously described”. In brief, 
cells were thawed in complete medium containing RPMI with 10% fetal 
bovine serum (FBS), 1% penicillin-streptomycin and 0.1mg ml‘ DNase. 
After washing in Maxpar Cell Staining Buffer (Fluidigm) cells were resus- 
pended in50 pl filtered antibody cocktail (Fluidigm) and incubated for 
60 min on ice. Cells were again rinsed, then resuspended in 100 pl of 
1:3,000-diluted In115-DOTA maleimide in buffer. Following additional 
rinses, cells were resuspended in 100 ul of 2% paraformaldehyde in 
buffer and incubated at 4 °C overnight. Cells were then washed twice 
with permeabilization buffer (eBioscience) and incubated onice for 45 
min. After rinsing, cells were resuspended in Ir-Interchelator in buffer 
and incubated for 20 min at room temperature. Cells were then washed 
with buffer, then MilliQ water and finally resuspended in MilliQ water 
for running ona Helios mass cytometer. To normalize mass cytometry 
data, the same lot of EQ Four Element Calibration Beads (Fluidigm) was 
used for all experiments to control for signal variation that may have 
occurred inthe instrument over time. Normalization of mass cytometry 
data was achieved using the Matlab bead normalization tool. 


SPADE and CITRUS analyses 

SPADE and CITRUS analyses were conducted using Cytobank. For SPADE 
conducted on mass cytometry data, we established a target number of 
nodes of 100 for our immunophenotyping assay. We based this num- 
ber onempirical evaluation of results from multiple runs on the same 
dataset, which showed comparable results. The SPADE population for 
immunophenotyping (live CD45" cells) was selected on the basis of the 
gating strategy shown in Extended Data Fig. 3. Cells were clustered onall 
markers except those used to exclude platelets or endothelial cells. For 
CITRUS, the same population of cells and same clustering channels were 
used as in SPADE to quantify the abundance of various populations. 
Results are from 5 x 10? events per sample, with a false discovery rate 
of 2%, minimum cluster size of 1% and cross validation folds set to 10. 
The predictive nearest shrunken centroid PAMR association model was 
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used. To avoid spurious results, we ran CITRUS with minimum cluster 
sizes of 1-4%, cross validation folds 5-10 and false discovery rate 1-5% 
for 1x 104, 1.5 x10* and 2 x 10° events, totalling 17 individual runs. For 
SPADE conducted on flowcytometry data, CD3*CD8* cells were gated 
and clustered with a target number of nodes of 30. 


Immunohistochemistry 

Paraffin-embedded brain tissues were sectioned at 5-m thickness. 
Deparaffinization was achieved by washing slides through a series of 
xylenes and decreasing concentrations of ethanol. Tissue sections were 
then subjected to antigen retrieval using citrate buffer, pH 6.0 (Sigma- 
Aldrich) at 95 °C for 30 min. Following rinsing with PBS, sections were 
incubated in blocking buffer containing PBS with 10% normal donkey 
serum and 0.03% Triton-X (Sigma-Aldrich) for 2h at room temperature. 
Slides were then incubated with primary antibody in blocking buffer 
overnight at 4 °C. The following day, slides were rinsed with PBS then 
incubated in appropriate Alexa Fluor secondary antibodies (Thermo 
Fisher Scientific). Sections were then rinsed and stained with Hoe- 
chst DNA dye (Thermo Fisher Scientific) before being coverslipped 
with ProLong mounting medium (Invitrogen). Primary antibodies 
included rat anti-human CD3 (Abcam), rabbit anti-human CD8a (Cell 
Signaling), mouse anti-AB (Cell Signaling), chicken anti-human MAP2 
(Abcam), mouse anti-human granzyme-A (Abcam), rat anti-mouse 
CD8a (eBioscience) and rabbit anti-mouse NEFH (Abcam). For mouse 
AB plaque staining, ThioflavinS (1 mg mI“, 1:625, Sigma) was added to 
the secondary antibody solution. 


Confocal microscopy and image processing 

For human tissue, the LSM880 confocal laser scanning microscope 
(Zeiss) was used to acquire images using 40x and 63~ objectives. 
For mouse tissue imaging, the LSM700 and LSM710 confocal laser 
scanning microscopes (Zeiss) were used, which were provided by the 
microscopy core facility of the Spinal Cord Injury and Tissue Regen- 
eration Center Salzburg (SCI-TReCS). For all Z-stacks, images were 
acquired using optical sectioning then combined into maximum inten- 
sity projections. 3D reconstructions were performed using Imaris v.9.1.2 
(Bitplane). 


Quantitative histology 

For quantification of human CD3*CD8* and GZMA‘CD8‘ T cells, homolo- 
gous sections of the hippocampal dentate gyrus were cut at 10-um 
thickness using a microtome (Leica Biosystems). Plot sampling was 
conducted by using CA3 as a boundary to draw a rectangular area of 
approximately 2 mm? that encompassed the molecular layer, granule 
celllayer and hilus. Three separate sections were sampled using a20x 
objective. CD3*CD8*' T cells were then manually counted by a blinded 
observer in ZEN 2 Blue Edition (Zeiss). 


Mice 

APP/PS1 mice” expressing a chimaeric mouse/human mutant amyloid 
precursor protein (Mo/HuAPP695swe) and a mutant human presenilin 
1(PS1-dE9) directed to CNS neurons under the prion protein promoter 
were used (TheJackson Laboratory). Mice were housed at the Paracelsus 
Medical University Salzburg in groups under standard conditions ata 
temperature of 22 °C and a12 hlight/dark cycle with ad libitum access to 
standard food and water. Over 10 mice aged 12-13 months comprising 
both sexes were randomly analysed in an unblinded manner. Animal 
care, handling, genotyping and experiments were approved by Para- 
celsus Medical University Salzburg ethical committees. 


Electron microscopy 

For ultrastructure analysis, 12-13-month-old APP/PS1 mice and wild- 
type controls were transcardially perfused with 4% PFA and 0.5% 
glutaraldehyde in 0.1 M phosphate buffer (PB). Brains were removed 
and 50-um sagittal sections were cut using a vibratome (Leica) and 


stored in 0.1M PB with 0.05% sodium azide. Sections with brain areas 
of interest were selected and pre-embedding 3,3’-diaminobenzidine 
(DAB)-immunostaining was performed as described previously*>”®. 
Rat anti-CD8 (eBioscience) primary antibody was used and incubated 
for 2 days at 4 °C. After several washes in PBS, sections were incubated 
for 2 days with biotinylated rabbit anti-rat antibody (Vector) at 4 °C. 
To increase staining signal, sections were incubated in the Vectastain 
ABC Hrp Kit (Vector) for 1h. After rinsing three times in PBS, sections 
were transferred to DAB using the Peroxidase Substrate Kit (Vector). 
Sections were rigorously washed and osmificated by incubating in 1% 
OsO4 (EMS) in 0.1M PB for Lh. Following 3 washes for 10 min in dis- 
tilled water, sections were incubated in Uranyl Acetate Replacement 
Stain-UAR (EMS) for 30 min. Sections were dehydrated by incubation 
in an increasing series of ethanol and transferred to propylenoxide 
(EMS) before embedding in araldite durcupan (four component resin, 
Sigma) overnight in aluminium foil cups at room temperature. The 
following day, slices were transferred to acetate slides (100 micron 
colour laser printer film, 5 Star Office Supplies), covered with small 
amounts of araldite, coated with a second acetate slide and hardened 
in an oven at 60 °C for 3 days. Hippocampal and cortical regions with 
CD8* T cells were identified using light microscopy and regions of 
interest were cut out of embedded sections. One side of the acetate 
slide was removed and samples were glued onto an araldite durcu- 
pan block with cyanoacrylate adhesive (UHU). This protocol was per- 
formed with small modifications according to previously reported 
protocols””®. After removing the acetate slides and careful trimming of 
the samples into atrapezial shape, semi-thin sections of 1-1.5 1pm were 
generated using an Ultracut Reichert E (Leica). Semi-thin sections were 
transferred to object slides and were stained with 1% Toluidine blue for 
orientation. After identification of stained cells, ultra-thin sections 
of 70 nm were directly cut using a diamond knife (Diatome). Sections 
were stretched with chloroform and subsequently collected with 50 
or 75 mesh copper grids of 3 mm diameter coated with 0.2% Formvar 
solution. Grids were dried and analysed with a LEO 912 transmission 
electron microscope (Zeiss) equipped with an in-column energy filter, 
at 100 kV by using a Slow Scan Dual Speed CCD camera TRS Sharpeye 
(Troendle). The imaging software used was Image SP v.1.2.7.31 (SIS, 
Soft Image System). 


TCR amplification by nested PCR plate-seq 

TCR sequencing was conducted according to our previously established 
protocols>*”’. In brief, TCR sequences from live CD3* single cells were 
obtained by a series of three nested PCR reactions. For all phases of 
PCR reactions, HotStarTaq DNA polymerase (Qiagen) was used. The 
phase 1 PCR reaction was a multiplexed PCR with multiple Va, VB, Ca 
and Cf region primers in a 16-pl reaction. For the phase 1 PCR reac- 
tion, the final concentration of each TCR V-region primer was 0.06 
UM and each C-region primer was 0.3 pM. A PCR reaction was done 
using the following conditions: 95 °C 15 min; 94 °C 30 s, 62 °C 1 min, 
72°C 1min x 16 cycles; 72 °C 10 min; 4 °C. Thereafter, a 1-pl aliquot of 
the phase 1 product was used as a template for the 12-11 phase 2 PCR 
reaction. The following cycling conditions were used for phase 2 PCR: 
95 °C 15 min; 94 °C 30 s, 64 °C 1 min, 72 °C 1 min x 25 cycles; 72°C 5 
min; 4 °C. For the phase 2 reaction, multiple internally nested TCRVa, 
TCRVB, TCRCa and Cf primers were used (V primers 0.6 uM, C primers 
0.3 uM). The phase 2 primers of TCR V-region contained a common 
23-base sequence at the 5’ end to enable amplification during the phase 
3 reaction with a common 23-base primer. A 1-1 aliquot of the phase 
2 PCR product was used as a template for the 14-pl phase 3 PCR reac- 
tion, which incorporated barcodes and enabled sequencing on the 
Illumina MiSeq platform. For the phase 3 PCR reaction, amplification 
was performed using a 5’ barcoding primer (0.05 pM) containing the 
common 23-base sequence and a3’ barcoding primer (0.05 LM) con- 
taining the sequence of a third internally nested Ca and/or Cf primer, 
and Illumina Paired-End primers (0.5 pM each). The following cycling 


conditions were used for phase 3 PCR: 95 °C 15 min; 94 °C 30 s, 66 °C 
30s, 72 °C 1 min x 25 cycles; 72 °C 5 min; 4 °C. The final phase 3 bar- 
coding PCR reactions for TCRa and TCRP were done separately. For 
the phase 3 reaction, 0.5 pM of the 3’ Ca barcoding primer and the 3’ 
CB barcoding primer were used. In addition to the common 23-base 
sequence at the 3’ end (that enables amplification of products from 
the second reaction) and a common 23-base sequence at the 5’ end 
(that enables amplification with Illumina Paired-End primers), each 
5’ barcoding primer contains a unique 5-base barcode that specifies 
plate and a unique 5-base barcode that specifies row within the plate. 
These 5’ barcoding primers were added witha multichannel pipette to 
each of 12 wells within a row within a plate. In addition to the internally 
nested TCR C-region sequence and a common 23-base sequence at the 
3’ end (that enables amplification with Illumina Paired-End primers), 
each 3’ barcoding primer contained a unique 5-nucleotide barcode 
that specified column. These 3’ barcoding primers were added witha 
multichannel pipette to each of eight wells within a column within all 
plates. After the phase 3 PCR reaction, each PCR product should have 
had a unique set of barcodes incorporated that specified plate, row and 
column and had Illumina Paired-End sequences that enabled sequenc- 
ing onthe Illumina MiSeq platform. The PCR products were combined 
at equal proportion by volume and run ona1.2% agarose gel, anda band 
around 350 to 380 bp was excised and gel purified using a Qiaquick gel 
extraction kit (Qiagen). This purified product was then sequenced. 


TCR plate-seq analysis 

TCR sequencing data were analysed as previously described*”’. Raw 
sequencing data were processed and demultiplexed using a custom 
software pipeline to separate reads from every well in every plate as 
per specified barcodes. All paired ends were assembled by finding a 
consensus of at least 100 bases in the middle of the read. The resulting 
paired-end reads were then assigned to wells according to barcode. 
Primer dimers were filtered out by establishing a minimum length 
of 100 bases for each amplicon. A consensus sequence was obtained 
for each TCR gene. Because multiple TCR genes might be present in 
each well, our software establishes a cut-off of >95% sequence identity 
within a given well. All sequences exceeding 95% sequence identity are 
assumed to derive from the same TCR gene and a consensus sequence is 
determined. The 95% cut-off conservatively ensures that all sequences 
derived from the same transcript are properly assigned. 


Drop-seq of peripheral CD8* T,-,,, and CSF cells 

For CD8* Typ, SCRNA-Seq, base call files from a HiSeq 4000 Sequencer 
(Illumina) were generated by the Stanford Genomics Core. For CSF 
and peripheral CD8* Tip, SCRNA-Seq, FASTQ files from a Novoseq S4 
sequencer were generated by Novogene. For CSF scTCR-seq, base call 
files froma NextSeq550 Sequencer (Illumina) were generated in-house. 
Single cell V(D)J and 5’ gene-expression analysis (1OX Genomics) was 
used for peripheral CD8* Typ, and CSF cells. Cell Ranger v.3.0.2 was 
used to generate gene-expression matrices for both peripheral Tera 
and CSF cells. The Cell Ranger mkfastq pipeline generated FASTQ 
files for both the 5S’ expression library and the V(D)J libraries. Reads 
from the 10X v.25’ paired library were mapped to the human genome 
build GRCh38 3.0.0. Reads from the 10X V(D)J kit were mapped to the 
vdj-GRCh38 alts ensemble 2.0.0 (available from 10X Genomics). The 
5’ gene-expression libraries were then analysed with the Cell Ranger 
count pipeline and the resulting expression matrix was used for further 
analysis in the Seurat package v.3.0. The V(D)J FASTQ files were analysed 
with the Cell Ranger vdj pipeline, which produced single cell V(D)J 
sequences and determined clonotypes. Clonotypes were determined by 
grouping of cell barcodes that shared the same set of productive CDR3 
nucleotide sequences. The sequences of all contigs from all cells within 
a clonotype were then assembled to produce a clonotype consensus 
sequence. Clonality was integrated into the Seurat gene-expression 
analysis by adding clonality information to the metadata. 


Clustering of peripheral CD8° T,,,,, and CSF cells 

Individual sample expression matrices were loaded into R using the 
function Read10x under the Matrix package v.1.2-15. The expression 
matrix for each sample was merged into one Seurat object using the 
CreateSeuratObject and MergeSeurat functions. The Seurat package 
v.3.0°°! was used for filtering, variable gene selection, normalization, 
scaling, dimensionality reduction, clustering and visualization. For 
CD8* Tera Cells, genes were excluded if they were expressed in fewer 
than 10 cells, and cells were excluded if they expressed fewer than 200 
genes. Cells that expressed more than 2,500 genes, more than10,000 
unique molecular identifiers (UMIs) and more than 10% mitochondrial 
genes were excluded. Regularized negative binomial regression using 
the sctransform normalization method was used to normalize, scale, 
select variable genes and regress out experimental batch, mitochon- 
drial mapping percentage and the number of UMIs. After filtering and 
normalization, there were 15,330 genes across 18,203 cells. Following 
principal component analysis (PCA), 10 principle components were 
selected for clustering the cells. A resolution of 0.4 was used with t-SNE 
visualization. For CSF cells, genes were excluded if they were expressed 
in fewer than 10 cells and cells were excluded if they expressed fewer 
than 200 genes. Cells that expressed more than1,800 genes, more than 
7,500 UMIs and more than 10% mitochondrial genes were excluded 
from the analysis. The sctransform normalization method was used 
to normalize, scale, select variable genes and regress out sequencing 
and experimental batch, mitochondrial mapping percentage and the 
number of UMIs. After filtering and normalization, there were 212,267 
samples and 14,734 features. Following PCA, four principle components 
were selected for clustering. A resolution of 0.3 was used with t-SNE 
visualization. 


Analysis of sCRNA-seq data 

For differential expression analysis, markers for each cluster 
were determined by comparing the cells of each cluster to all other 
cells using the FindMarkers function in Seurat with the MAST 
algorithm from the R package MAST (v.1.8.2). For all comparisons 
between groups and clusters, only genes expressed by at least 10% 
of cells were included. The R package ggplot2 (v.3.1.0) was used 
to plot the results of differential expression analyses, showing the 
average log-transformed fold change of each gene and the -log10 
of the adjusted P value (Benjamini-Hochberg correction). Seurat was 
used to produce violin plots of the expression of selected genes. For 
pathway analysis, Panther was used to perform Reactome pathway 
analysis with genes that were identified from differential expression 
analysis (g < 0.05) and with all genes in the dataset as background. 
Fisher’s exact test was used with Bonferroni correction for multiple 
testing. Z-scores for each pathway were calculated using the R pack- 
age GOPlot. 


Determination of antigen specificity of V(D)J sequences 

To determine whether TCR sequences identified in our scTCR-seq 
experiments on peripheral T,y4g, and CSF cells had known antigen 
specificity, the CDR3b region of each B chain was compared to the 
CDR3b repertoire from the VDJdb at https://vdjdb.cdr3.net/. 


TCR clonality analysis 

For CSF cells sequenced by plate-seq, analysis of clonality was per- 
formed using Excel. TCRs with two or more identical CDR3b regions and 
CDR3a regions were defined as clonal. R was used to calculate a propor- 
tion of the sum of reads of the clonotypes to the overall number of reads 
inarepertoire (Dreads of top clonotypes)/(>) reads for all clonotypes). 
For TCRs sequenced using single-cell V(D)J technology (10X Genomics), 
clonality was determined by the Cell Ranger vdj pipeline as previously 
described. Clonotypes were determined from grouping of cell barcodes 
that shared the same set of productive CDR3 nucleotide sequences. 
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The sequences of all contigs from all cells within a clonotype were then 
assembled to produce a clonotype consensus sequence. Clonality was 
integrated into the Seurat gene-expression analysis by adding clonality 
information to the metadata. For TCR network analysis, to depict con- 
nections between diagnosis groups, patients and clonotypes, we used 
the qgraph package for R. Only TCRs with full a and § chain sequences 
were included in the analysis. Unweighted networks were generated 
with all subjects and split per diagnosis group. 


Gene-engineered T cell lines 

The two TCRs identified from patients with AD by GLIPH were cloned 
into a pHR vector using the signal peptides of the TCR variable genes 
and full-length, unmodified constant regions from IMGT/V-QUEST. 
Packaging vectors pMD2G and pPAX were used to allow for human 
mammalian tropism and viral generation. Lentivirus was generated 
for each TCRa and TCR separately in Lenti-X 293T cells (Takara). Cells 
were cultured with 10% fetal bovine serum (FBS) in Dulbecco’s modified 
Eagle’s medium (DMEM) and viruses were collected at 48 h, filtered 
with 0.45-mM syringe filters (Thermo Fisher Scientific), and frozen 
at -80 °C or used immediately to infect the TCR-deficient SKW-3 cell 
line (Creative Bioarray; these cells were verified to lack TCRs using 
an antibody against TCRaf and confirmed negative for mycoplasma 
contamination). TCRaB 1 expressed TCRa: CAASEGGFKTIF and TCRB: 
CASSLGTGNNEOFF. TCRaf 2 expressed TCRa: CAADRTGGGNKLTF and 
TCRB: CASSLAGGYNEOQFF. Combined TCRa and TCR viruses were 
used to infect 2 x 10° SKW-3 cells at a multiplicity of infection of five 
by spinning for 1h at 2,500 rpm at 32 °C, and cells were then placed 
back in culture. After 48 h, cells were collected, rinsed with PBS then 
incubated with live/dead near infrared dead cell dye (Thermo Fisher 
Scientific), stained for TCRaf (IP26, BioLegend) and sorted ona FAC- 
SAria II cell sorter (BD). TCRaB* cells were sorted before each experi- 
ment and cultures were maintained in DMEM containing 10% FBS and 
1x penicillin-streptomycin supplemented with recombinant human 
IL-2 (20 IU mI; R&D Systems). 


TCR stimulation 

For aCD2/CD3/CD28 TCR stimulation, anti-biotin particles loaded 
with the biotinylated antibodies were used to mimic antigen-present- 
ing cells and activate TCRs. Transduced SKW-3 cells were incubated 
with biotinylated antibodies against human CD2, CD3 and CD28 
(Miltenyi) in DMEM containing 10% FBS and 1x penicillin-streptomycin. 
Data were analysed using Cytobank and samples were gated on SKW-3 
cells by forward and side scatter before analysis for TCRaB and CD69 
expression. 


Autologous fibroblast antigen presentation 

T cell lines were co-cultured with autologous fibroblasts in a 2:1 ratio 
during antigen presentation. For MCH-I peptide pool stimulation, a 
mix of 80 known MHC-lantigenic peptides (CEFX Ultra SuperStim Pool 
MHC-I Subset; JPT Peptides) were used to screen for reactive TCRs. 
Fibroblasts were plated at 5 x 10* cells per well in 24-well plates. The 
peptide mix contained 0.3-1.2 pg per peptide per pl (dependent on 
each peptide’s molecular weight) dissolved in DMSO and hada purity of 
each peptide of greater than 70% as determined by high-performance 
liquid chromatography (HPLC). Peptide pools were added to fibroblast 
plates at 1:100 dilution in 250 pl medium. Control wells contained DMSO 
only. For individual peptide stimulations, peptides were produced at 
a purity of greater than 90% as determined by HPLC (JPT Peptides). 
Following 18 h of incubation at 37 °C, fibroblasts were heat-shocked 
at 41°C for 30 min to increase antigen presentation. Fibroblasts were 
then rinsed with PBS and 1x 10° cells from each T cell line were addedin 
medium containing purified no azide/low endotoxin mouse anti-human 
CD28 antibody (2 pg mI™; BD Biosciences) and IL-2 (20 IU mI”). After 
18 h, cells were analysed for activation by flow cytometry for TCRaB 
and CD69 expression. 


Statistical methods 


All statistical analyses were performed using commercially 
available software (Prism, SPSS or Excel). All values are expressed 
as the mean + SEM. Differences in means between two groups 
were analysed using unpaired two-sided heteroscedastic t-tests with 
Welch’s correction, unless otherwise noted. For regression analyses, 
the significance of the difference between two datasets was meas- 
ured by ANCOVA. Differences in means among multiple datasets were 
analysed using one- or two-way analysis of variance (ANOVA). When 
ANOVA showed significant differences, pair-wise comparisons between 
means were tested by Sidak’s or Tukey’s multiple comparisons test. For 
scRNA-seq analyses, we corrected for multiple comparisons and report 
adjusted P values using Benjamini-Hochberg correction. For pathway 
analyses, Fisher’s exact test was used with Bonferroni correction for 
multiple testing. No statistical methods were used to predetermine 
sample size. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


scRNA-seq and scTCR-seq datasets have been deposited online in the 
Gene Expression Omnibus (GEO) under accession number GSE134578. 
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Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[ ] For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection Brain MRI quantifications were conducted using FreeSurfer 6.0. 
Confocal images were acquired using ZEN 2 Blue Edition (Zeiss). 
Electron microscopy images were acquired with Imaging software was Image SP v1.2.7.31 (SIS, Soft Image System). 
Clonality measurements were calculated using R Studio. 
Mass cytomtery analysis was conducted using Cytobank. 
For TCR network analysis, to depict connections between diagnosis groups, patient IDs and clonotypes, we used the qgraph package for R 
(Epskamp et al. 2012). Only TCRs with alpha and beta chain sequenced were included in the analysis and non-clonal cells were filtered 
out. Unweighted networks were generated with all subjects and split per diagnosis group. 
Sacha Epskamp, Angelique O. J. Cramer, Lourens J. Waldorp, Verena D. Schmittmann, Denny Borsboom (2012). qgraph: Network 


Visualizations of Relationships in Psychometric Data. Journal of Statistical Software, 48(4), 1-18. URL http://www. jstatsoft.org/v48/i04/. 


Data analysis Normalization of mass cytometry data was achieved using the Matlab bead normalization tool. 
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Data analysis 
3D reconstructions of mouse CD8 T cells were performed using Imaris v9.1.2 (Bitplane). 


For volumetric measurements in Extended Data Figure 1: for subcortical segmentation, we used the "recon-all" command. For 
hippocampal segmentation, we appended the flag “-hippocampal-subfields-T1” to the "recon-all" command for each patient. To correct 
for sex differences, we normalized all volumetric measurements to total intracranial volume for each patient. 


TCR clonality analysis of CSF cells by plate-seq, we used Excel. TCRs with two or more identical CDR3b regions and CDR3a regions were 
defined as clonal. R was used to calculate a proportion of clonotypes’ sum of reads to the overall number of reads in a repertoire (5 reads 
of clonotype)/(5 reads for all clonotypes). For TCRs sequenced using Single cell V(D)J technology (10X Genomics), clonality was 
determined by the cellranger vdj pipeline as previously described. Clonotypes were determined from grouping of cell barcodes that 
shared the same set of productive CDR3 nucleotide sequences. The sequences of all contigs from all cells within a clonotype were then 
assembled to produce a clonotype consensus sequence. Clonality was integrated into the Seurat gene expression analysis by adding 
clonality information to the metadata. For TCR network analysis, to depict connections between diagnosis groups, patients and 
clonotypes, we used the qgraph package for R. Only TCRs with full a and B chain sequences were included in the analysis. Unweighted 
networks were generated with all subjects and split per diagnosis group. 


Analysis of scRNAseq data 

Differential Expression 

Markers for each cluster were determined by comparing the cells of each cluster to all other cells using the FindMarkers function in 
Seurat with the Model-based Analysis of Single Cell Transcriptomics (MAST) algorithm from the R package ‘MAST’ version 1.8.2. For all 
comparisons between groups and clusters, only genes expressed by at least 10% of cells were included. The R package ‘ggplot2’ version 
3.1.0 was used to plot the results of the differential expression analysis, showing the average log fold change of each gene on the x axis 
and the -log10 of the p value adjusted (Benjamini-Hochberg correction). Seurat was used to produce violin plots of the expression of 
select genes. 
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Pathway analysis 

Panther was used to perform Reactome pathway analysis with genes identified from differential expression analysis (q<0.05) and with all 
genes in the dataset as background. Fisher’s Exact test was with the Bonferroni correction for multiple testing. Z-scores for each pathway 
were calculated using the R package ‘GOPlot’. 


Determination of Antigen Specificity of V(D)J Sequences 
To determine whether TCR sequences identified in our scTCRseq experiments on peripheral TEMRA and CSF cells had known antigen 
specificity, the CDR3b region of each beta chain was compared to the CDR3b repertoire from the VDJdb at https://vdjdb.cdr3.net/. 


Clustering of peripheral CD8+ TEMRA and CSF cells 

Individual sample expression matrices were loaded into R using the function Read10x under the ‘Matrix’ package v1.2-15. The expression 
matrix for each sample was merged into one Seurat object using the CreateSeuratObject and MergeSeurat functions. Seurat package 
v3.043,44 was utilized for filtering, variable gene selection, normalization, scaling, dimensionality reduction, clustering and visualization. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Source data for figures are provided within Extended Data Figures. 


RNA-seq datasets have been deposited online in the Gene Expression Omnibus (GEO) under accession number GSE134578. 


Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Our mass cytometry study measured two independent study groups (healthy vs. MCI/AD) with continuous primary endpoints. We used power 
analyses to determine the minimum number of study subjects required per group, which was calculated to be 42 healthy and 14 MCI/AD. We 


exceded these numbers in the study by analyzing 57 healthy and 23 MCI/AD samples. We established anticipated means for our healthy 
cohort based on preliminary experiments and considered an enrollment ratio of 3:1 (healthy: MCI/AD) based on enrollment data from the 
clinic. We measured group sizes to account for a 20% increase in our MCI/AD group. Our type Il error rate was set to 0.05 and our power level 
set to 90%. Following the study, we performed a post-hoc power analysis with our sample sizes and found our mass cytometry study to have 
92.3% power. 


Data exclusions Subjects who progressed from a clinical diagnosis of mild cognitive impairment to any form of dementia besides Alzheimer's were removed 
from the mass cytometry analysis. 


Replication All experiments in this study were successfully replicated at least twice. 


Randomization — Study participants were allocated into groups following data acquisition. Study subjects underwent a battery of neuropsychological 
assessments to determine group status, including: cognitive examination, evaluation of cerebellar function, deep tendon reflexes, sensory 
input, and motor function. The Montreal Cognitive Assessment (MoCA) examination was used to test study subjects for cognitive impairment. 
The MoCA assesses several cognitive domains: short-term memory recall (5 points), visuospatial abilities (4), executive functions (4), attention 
(1), concentration (3), working memory (1), language (6) and orientation to time and space (6). MoCA scores range between 0 and 30. A score 
of 26 or over is considered to be cognitively typical. 
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Blinding All experimenters were blinded to group allocation during data acquisition and analysis. 


Reporting for specific materials, systems and methods 


aterials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Unique biological materials 


Policy information about availability of materials 


Obtaining unique materials | Fresh human brain samples are difficult to obtain under a short post-mortem window and are thus not readily available. CSF 
samples contain very few cells and can only be analyzed once and are thus not readily available. 


Antibodies 


Antibodies used Flow cytometry was conducted using an LSRFortessa (BD Biosciences). A panel consisting of antibodies conjugated to six 
different fluorophores was used to classify subsets of memory T cells and for droplet-based sequencing. Antibodies used were: 
CD8a-Pacific blue (Clone SK1; BioLegend cat #344718), CD3-BV650 (Clone UCHT1; BD Biosciences cat #563851), CD45RA-APC- 
Cy7 (Clone HI100; BioLegend cat #304128), CCR7-488 (Clone 4B12; BioLegend cat #120110), IL-7Ra-PE (Clone AO19D5; 
BioLegend cat #351304), CD27-PE-Cy7 (Clone M-T271; BioLegend #356412). For CSF cell characterization, this same panel was 
used, but CD19-PE-Cy5 (Clone HIB19 BioLegend cat #302210) and CD14-Qdot-705 (Clone TUK4; ThermoFisher cat #Q22137) 
were included. For sorting CSF T cells for TCR plate sequencing, the following antibodies were used: CD8a-PE (Clone SK1; 
BioLegend cat #344706), CD161-PE-Cy7 (Clone HP-3G10; BioLegend #339917), CXCR3-APC (Clone CXCR3-173; BioLegend cat 
#126512), CD4-APC-Alexa700 (Clone RM4-5; ThermoFisher cat #56-0042-82), CD39-APC-Cy7 (Clone A1; BioLegend cat #328226), 
CD38-FITC (Clone HIT2; BioLegend cat #303504), PD-1-BV421 (Clone EH12.1; BD Biosciences cat #562516), CD45RA-BV605 
(Clone 5H9; BD Biosciences cat #740424), CD3-BV650 (Clone UCHT1; BD Biosciences cat #563851), CD27-BV786 (Clone L128; BD 
Biosciences cat #563328), CD127-BUV395 (Clone HIL-7R-M21; BD Biosciences cat #742547). All antibodies were used at 1:250 
dilution. For each experiment, a compensation matrix was developed using singly stained and unstained controls or fluorescent 
beads, and all analysis was conducted in Cytobank. 


For immunohistochemistry, primary antibodies included polyclonal rat anti-human CD3 (1:200; Abcam cat #ab28364), rabbit 
anti-human CD8a (1:200; Clone D8A8Y; Cell Signaling), mouse anti-AB (1:2000; Clone D54D2; Cell Signaling), polyclonal chicken 
anti-human MAP2 (1:500; Abcam cat #ab5392), mouse anti-human granzyme-A (1:200; Clone GA6; Abcam), rat anti-mouse CD8a 
(Clone 4SM15; eBioscience cat #14-0808-82) and polyclonal rabbit anti-mouse NEFH (Abcam cat #ab8135). For mouse AB plaque 


staining, ThioflavinS (1 mg/mL, 1:625, Sigma) was added to the secondary antibody solution. 


Validation Each primary antibody was validated using proper fluorophore controls and positive signal was evaluated versus an unstained 
sample. All antibodies have been previously published by our group (Han et al., Nat. Biotech., 2014). All antibodies are from 
reputable vendors and come with validation statements on the manufacturer's website. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) SKW-3 cells were provided by Mark Davis' lab (Stanford). Lenti-X 293T cell line was acquired from Takara (cat #632180). 


Authentication Verification of TCR deficiency in SKW-3 cells line was conducted by flow cytometry against TCRab. 293T cells were not 
authenticated, but functionally produced lenti viruses. 


Mycoplasma contamination Negative for mycoplasma 
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Commonly misidentified lines SKW-3 cells are commonly misidentified as KE-37 cells. Our cells were verified to lack TCRs and functionally behave like T cells 
(See ICLAC register) when transduced with T cell receptors. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals APP/PS1 mice expressing a chimeric mouse/human mutant amyloid precursor protein (Mo/HuAPP695swe) and a mutant human 
presenilin 1 (PS1-dE9) directed to CNS neurons under the prion protein promoter were used (The Jackson Laboratory). Mice 
were housed at the Paracelsus Medical University Salzburg in groups under standard conditions at a temperature of 22°C anda 
12 h light/dark cycle with ad libitum access to standard food and water. Over 10 mice aged 9-10 months comprising both sexes 
were randomly analyzed in an unblinded manner. Animal care, handling, genotyping and experiments were approved by 
Paracelsus Medical University Salzburg ethical committees. 


Wild animals The study did not involve wild animals. 


Field-collected samples The study did not involve samples collected from the field. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics A total of 164 subjects from four separate cohorts were used in this study. The 164 subjects used in this study included 97 
healthy, 31 MCI, 28 AD and 8 PD. The average age for healthy subjects was 72.52 + 6.96 (meantstandard deviation), for MCI 
subjects: 70.97 + 7.82, for AD subjects: 70.74 + 7.01 and for PD subjects: 67.25 + 7.01. Average cognitive score for healthy 
subjects was 27.17 + 2.32, MCI: 23.39 + 4.31, AD: 11.05 + 7.32 and PD: 27.29 + 1.70. All subjects were free from acute infectious 
diseases and in good physical condition. Patients were confirmed to have neurodegeneration by measurement of biomarkers, 
including neurofilament, AB and tau (Quanterix). Group characteristics, including demographic, genetic, clinical and biomarker 
data for each group are presented in Extended Data Table 10. 


Recruitment Study subjects were recruited through Stanford's NIH-funded Alzheimer's Disease Research Center (ADRC), the UC San Diego 
ADRC and the UC San Francisco ADRC by general solicitation. The racial makeup of study subjects reflects that of the Stanford 
area and should be considered when analyzing these results. 


Flow Cytometry 
Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


ethodology 


Sample preparation PBMCs were isolated from blood by layering diluted blood (1:1 in PBS) on top of an equal volume of Ficoll, followed by 
centrifugation and isolation of the buffy coat. PBMC samples were stored over the course of a three-year period. Average PBMC 
viability for patient samples following sample thawing was 79% while the average viability for control samples was 77%. 


CSF was collected by lumbar puncture, then centrifuged at 300G to pellet immune cells. CSF samples were checked for blood 


contamination by resuspending the pelleted cells in 100 yl of CSF and mixing 10 pl (10%) CSF with 10 ul trypan blue to assess red 
blood cell content and viability. Cells were visualized on a TC20 automated cell counter (BioRad) and cell viability and presence/ 
absence of red blood cells was recorded. CSF samples contaminated with blood were removed from the study. The resuspended 
cells were then mixed with 900 ul Recovery Cell Culture Freezing Medium (Thermo Fisher). All Samples were frozen overnight at 
-80°C in a Mr. Frosty freezing container (Thermo Fisher) and transferred the following day to liquid nitrogen for storage. CSF 
samples were stored <6 months prior to analysis. 

Instrument BD LSRFortessa 

Software Cytobank v6.3.0.5 


Cell population abundance — 10,000 live leukocytes were gated for downstream SPADE analysis for mass cytomtery. 
For peripheral TEMRA scRNAseq, 50,000 peripheral cells were sorted then loaded into a 10X controller at 1,000 cells/ul. 
For CSF plate-seq, cells were sorted into individual wells of a 96-well plate. 


For CSF cell drop-seq, all cells from each CSF sample were live sorted, pelleted then loaded into the 10X controller. 
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Since CSF cells are in low abundance (approximately 1 cell per ul), processing of samples (i.e. repetitive centrifugation) was 
carefully limited to achieve greater numbers of cells. 


Gating strategy Cells were distinguished from debris by SSC/FSC, then signlets were selected on a FSC-H/FSC-W plot. Cells doubly positive for 
CD3 and CD8 were gated (~10%3 expression in each channel) 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


Magnetic resonance imaging 


Experimental design 


Design type T1 weighted MRI Scans were acquired using Axial 3D fast spoiled gradient sequence (GE Discovery 750). 
Design specifications n/a (this is structural MRI data, not functional) 


Behavioral performance measures n/a 


Acquisition 
maging type(s) Structural 
Field strength 3 Tesla 
Sequence & imaging parameters The imaging parameters were optimized for gray/white matter tissue contrast with a repetition time of 5.9ms, echo 
time 2ms, flip angle 15, field-of-view 220mm, matrix size 256 x 256, slice thickness Imm and 2 NEX. 
Area of acquisition Whole brain 
Diffusion MRI Used Not used 


Preprocessing 


Preprocessing software All the suggested T1-weighted scans use GE's "BRAVO" sequence. It is an IR-prep, fast SPGR sequence with parameters 
tuned to optimize brain tissue contrast. 


Normalization To correct for sex differences, we normalized all volumetric measurements to total intracranial volume for each patient. 


Normalization template MNI305 atlas was used for normalization. First stage was affine registration, followed by initial volumetric labeling. The 
variations due to biasing were corrected. Finally, a high dimensional nonlinear volumetric alignment to the MNI305 atlas 
was performed, and once preprocessing completed, the volume was labeled. 

Noise and artifact removal n/a These are structural data, not fMRI data 


Volume censoring n/a These are structural data, not fMRI data 


Statistical modeling & inference 


Model type and settings Comparisons between groups of subjects were performed using multivariate analysis of covariance, where age served 
as a covariate. 


Effect(s) tested Increase/decrease in brain region volume 


Specify type of analysis: Whole brain ROI-based Both 


Subcortical segmentation including both left and right of Cerebral White Matter, Cerebral Cortex, Lateral 
Ventricle, Inferior Lateral Ventricle, Cerebellum White Matter, Cerebellum Cortex, Thalamus, Caudate, 
Putamen, Pallidum, Hippocampus, Amygdala, Lesion, Accumbens area, Vessel, Third Ventricle, Fourth 
Ventricle, Brain Stem, Cerebrospinal Fluid 


Anatomical location(s) 


Statistic type for inference n/a 
(See Eklund et al. 2016) 


Correction n/a 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 


Graph analysis 


Multivariate modeling or predictive analysis 
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Extended Data Fig. 1| Clinical measures of subjects from mass cytometry 
experiments. a, Cohort 1 groups were age-matched and included 57 healthy 
subjects and 23 patients with MCI or AD. b, Cognitive scoring shows 
significantly decreased cognitive scores in patients with MCI or AD. Note that 
notall patients with MCI or AD could complete a cognitive exam and these 
patients are thus not included in this analysis. Unpaired two-sided ¢-test with 
Welch's correction (n=51 healthy; n=12 MCI or AD); mean+s.e.m.c,d, 
Quantification of CSF AB asa ratio to phosphorylated tau (c) or total tau (d) 
reveals significantly reduced ratios in patients with MCI or AD. Unpaired two- 
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sided t-test with Welch’s correction (n=18 healthy, n=7 MClor AD); 

mean +s.e.m.e, Top, representative MRI images show reduction in cortical 
grey matter in patients with AD. Middle, bottom, representative MRI images of 
several brain regions as measured by subcortical segmentation and 
hippocampal segmentation. Right, quantification of MRI images of patients 
with MClor AD compared to control individuals shows significant reductions 
in the percentage of the intracranial volume (ICV) of brain regions classically 
associated with AD pathology. CA1, cornu ammonis area 1. Unpaired two-sided 
t-test; mean+s.e.m. 
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Extended Data Fig. 2 | Gating strategy for identifying immune cell populations from mass cytometry data. A gating strategy was used to identify populations 
ofimmune cells by mass cytometry. 
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Extended Data Fig. 3 | Mass cytometry SPADE and CITRUS clustering shows 
increased numbers of CD8* Tyr, cells in MCI or AD. a, Normalization of mass 
cytometry data was performed before all analyses. A gating strategy for input 
into downstream analyses is shown. Live, single leukocytes were selected for 
analysis. b, Plotting of clusters by Pvalue and fold change of each cluster 
reveals cluster 63 as the most highly increased cluster among patients with MCI 
or AD. Clusters are sized according to their percentage of total PBMCs. 
Unpaired two-sided t-test (n=57 healthy; n=23 MCl or AD).c, CITRUS 
clustering showing significant differentiating populations (top left). Cluster 
229992 andi its significant daughter populations are outlined. Expression of 
CD3, CD8 and CD45RA shows that cluster 229992 corresponds to CD8* Tempra 
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cells. d, Quantification of cluster 229992 cells as a percentage of total PBMCs 
for individual subjects. Percentages of this cluster are significantly higher in 
patients with MCI or AD than healthy control individuals. Unpaired two-sided 
t-test with Welch’s correction; mean + s.e.m.e, Marker expression of cluster 
229992 shows it to bea CD3*CD8*CD45RA*CD27 Tey, population. f, The 
regularized supervised learning algorithm from CITRUS predicts disease 
group witha 20% error rate (80% positive predictability). The number of model 
features increases from left to right. The most predictive model is shownas the 
lowest cross validation error rate (red line) constrained by the false discovery 
rate (yellow triangle). 
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Extended Data Fig. 4| Altered CD8* T cell subsets in PBMCs from patients 
with MCl or AD. a, Mass cytometry plots of live PBMCs show increased 
abundance of CD3*CD8’ T cells in patients with MCI or AD. b, Gating of CD3* 
Tcells into CD4 and CD8 populations shows increased prevalence of CD8* 
Tcellsin patients with MCI or AD.c, d, Significantly increased effector 
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Extended Data Fig. 5| Correlations of memory T cell populations with 
cognitive scores. a, Gating strategy for measuring memory T cell populations. 
b, Linear regression analysis correlating CD8* T cell populations with cognitive 
scores indicates a positive relationship between T,,, and T;,, CD8* T cells and no 
relationship with naive cells. The significance of the difference between 
datasets was measured by ANCOVA.c, The relationship between the 
percentage of CD8* Tey, cells and cognitive score was not influenced by age. 
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The significance of the difference between the two datasets was measured by 
analysis of covariance (ANCOVA). Pearson’s correlationr values are shown for 
each group (b,c). d, Gating strategy for T cell stimulation experiments. 

e, Increased intracellular TNF cytokine response in PMA-ionomycin- 
stimulated CD8’" T cells from patients with MCI or AD. Unpaired two-sided f-test 
with Welch’s correction (n=10 healthy; n=14 MCl or AD); mean+s.e.m. 
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Extended Data Fig. 6 | Analysis of peripheral CD8* Typ, cells by scRNA-seq. 
a, Gating strategy for sorting peripheral CD8* Typ, cells for scRNA-seq. 

b, Differential expression analysis by scRNA-seq of CD8* Tyypa Cells from 
patients with MCI or AD versus CD8* Tyypa Cells from healthy individuals shows 
significantly increased expression of genes that are involved in T cell signalling, 
including /FITM3, NFKBIA and CDSB2. Violin plots show average log-normalized 
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individual patient (n=7 healthy; n= 6 MClor AD).c, Violin plots show average 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | Histological analysis of CD8’ T cells in hippocampi 
from patients with AD and from APP/PSI1 mice. a, A blood vessel in the brain of 
acontrol (non-neurological disease) patient shows a lack of extravascular CD8* 
Tcells. Asterisk indicates the blood vessel lumen. b, ACD8° T cell within an AB 
plaque. Scale bar, 5 um.c, CD8*T cells in the AD-affected hippocampus in close 
proximity to AB plaques. White lines measure the distances from each cell to 
the nearest plaque centre. Dataina, bwere replicated in at least three 
independent experiments. d, Quantification of the average distance from CD8* 
Tcells to the nearest AB plaque. The dashed red line indicates the average of all 
cells. e, Representative images of the dentate gyrus that were used to quantify 


CD3*CD8° T cells in the hippocampi of control individuals and patients with AD. 
Inset shows two CD3*CD8'T cells. Sizes of the area plots used for quantification 
are shown for each image. Scale bar, 500 pm. f, Association of a CD8' T cell with 
NEFH* neuronal processes by immunohistochemistry and 3D modelling in the 
APP/PS1 mouse model of cerebral amyloidosis. g, Electron microscopy 
showing an association of a CD8* T cell with neuronal processes. Red 
arrowheads indicate areas in which the CD8' T cell associates with neuronal 
processes. Data in e-g were replicated in at least two independent 
experiments. 
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Extended Data Fig. 8 | Clonal expansion of T cells in the CSF. a, Gating 
strategy for enumerating T cell subtypes in CSF from healthy elderly 
individuals. b, Quantification of CSF cells shows that the majority of cells are 
Tcells, witha minority population of CD14* monocytes anda quantifiably 
minor number of CD19" B cells (n =10 healthy subjects); mean+s.e.m.c, 
Quantification of CD8* T cell subsets shows that the majority of cells are T,,, or 
Tempra Cells (2 =10 healthy subjects); mean +s.e.m.d, Single-cell sorting of CSF 
cells shows that CD4 and CD8T cells were sorted from control individuals and 


patients with AD. Data were replicated in two independent experiments. 

e, Donut plots depicting CSF clonality in plate-seq samples. Clones are 
coloured by their proportion of the total TCRaf sequences for each subject. 
f, Gating strategy for drop-seq experiments. Live CSF cells were sorted using a 
live/dead marker. g, Multidimensional reduction and visualization by t-SNE 
shows distribution of CSF cells by group, sex and patient (n=9 healthy, n=9 
MClor AD). 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Clonally expanded CD8‘ T cells in CSF from patients 
with AD and patients with PD. a, The top individual clones in AD were assessed 
by combining scRNA-seq and scTCR-seq datasets. A MAST differential 
expression test with Benjamini-Hochberg correction was conducted to 
compare CD8* T cell maximum clones (n=12-18 cells) from patients with MCI 
or AD against all CSF T cells. Note the colocalization of AD clones with CD8* 
clusters onthe ¢-SNE plots. b, Separate analysis of patients with MCI, AD and PD 
by percentages of maximum clones revealed an enrichment of highly expanded 
clones (defined as comprising 3% or more of all TCRaB sequences; indicated by 
dotted line) in patients with these diseases. Only one out of ten healthy 
subjects hada highly expanded clone in their CSF, versus four out of six 
patients with AD, two out of six patients with PD and one out of five patients 
with MCI.c, Quantification of overall clonality (defined as the percentage of 


total TCRaB sequences that are identical to one or more TCRaf sequences) in 
the four groups of cohort 4. Significance was measured by two-way ANOVA 
followed by Tukey’s multiple comparisons test. Only samples with detectable 
clones were included in the analysis (n=11 healthy; n=5 MCI;n=5AD;n=6PD). 
Box plots in b, c Box plots show median and 25th to 75th percentiles, and 
whiskers indicate the minimum and maximum values. d, Gene-expression 
analysis was conducted on all 24 samples and clustered by ¢-SNE. Clusters were 
composed of a mixture of groups, patients, clonal and non-clonal cells. 

e, Genes (encoding cytotoxic effector proteins) that showed increased 
expression in amaximum PD clone (n= 14 cells) were analogous to those 
observed to be overexpressed in AD clones. MAST differential expression test 
with Benjamini-Hochberg correction. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | EBV BZLF1 antigen identification in the CSF of 
patients with AD. a, TCRB chains derived from GLIPH were used to clone two 
full TCRafB TCRs that were introduced into TCR-deficient SKW-3 cells by 
lentiviral transduction. TCRaB land TCRaf 2 cell lines expressed TCRaf by 
flow cytometry but controls (no virus and empty lenti viral vector) did not 
express TCRaf. Data were replicated in three independent experiments. 

b, Both TCRaf 1and TCRaf 2 cells upregulate the activation marker CD69 after 
stimulation with aCD2/CD3/CD28 beads (n=3 per group). One-way ANOVA 
(F(3,8) =204.02, P= 6.78 x 10 ®) with Tukey’s test for multiple comparisons; 
mean +s.e.m.c, Gating strategy for MHC-I peptide pool experiments. d, TCRaB 


land TCRaf 2 were presented with antigens in anon-autologous fashion 
(mismatch between fibroblast and TCR). No significant differences in 
reactivity were detected in either cell line. One-way ANOVA (F(3,8) =1.16, 
P=0.38) with Tukey’s test for multiple comparisons; mean +s.e.m.e, Individual 
histograms of autologous candidate peptide stimulations. CD69 expression is 
shown for control DMSO and each peptide for both cell lines. Note the 
increased expression of CD69 induced by peptide 7 in TCRaf 1 cells. Data were 
replicated in three independent experiments. f, Gating strategy for quantifying 
HLA-B*08:01 EBV BZLF1 dextramer positivity. 
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Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[ ] For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection Brain MRI quantifications were conducted using FreeSurfer 6.0. 
Confocal images were acquired using ZEN 2 Blue Edition (Zeiss). 
Electron microscopy images were acquired with Imaging software was Image SP v1.2.7.31 (SIS, Soft Image System). 
Clonality measurements were calculated using R Studio. 
Mass cytomtery analysis was conducted using Cytobank. 
For TCR network analysis, to depict connections between diagnosis groups, patient IDs and clonotypes, we used the qgraph package for R 
(Epskamp et al. 2012). Only TCRs with alpha and beta chain sequenced were included in the analysis and non-clonal cells were filtered 
out. Unweighted networks were generated with all subjects and split per diagnosis group. 
Sacha Epskamp, Angelique O. J. Cramer, Lourens J. Waldorp, Verena D. Schmittmann, Denny Borsboom (2012). qgraph: Network 


Visualizations of Relationships in Psychometric Data. Journal of Statistical Software, 48(4), 1-18. URL http://www. jstatsoft.org/v48/i04/. 


Data analysis Normalization of mass cytometry data was achieved using the Matlab bead normalization tool. 
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Data analysis 
3D reconstructions of mouse CD8 T cells were performed using Imaris v9.1.2 (Bitplane). 


For volumetric measurements in Extended Data Figure 1: for subcortical segmentation, we used the "recon-all" command. For 
hippocampal segmentation, we appended the flag “-hippocampal-subfields-T1” to the "recon-all" command for each patient. To correct 
for sex differences, we normalized all volumetric measurements to total intracranial volume for each patient. 


TCR clonality analysis of CSF cells by plate-seq, we used Excel. TCRs with two or more identical CDR3b regions and CDR3a regions were 
defined as clonal. R was used to calculate a proportion of clonotypes’ sum of reads to the overall number of reads in a repertoire (5 reads 
of clonotype)/(5 reads for all clonotypes). For TCRs sequenced using Single cell V(D)J technology (10X Genomics), clonality was 
determined by the cellranger vdj pipeline as previously described. Clonotypes were determined from grouping of cell barcodes that 
shared the same set of productive CDR3 nucleotide sequences. The sequences of all contigs from all cells within a clonotype were then 
assembled to produce a clonotype consensus sequence. Clonality was integrated into the Seurat gene expression analysis by adding 
clonality information to the metadata. For TCR network analysis, to depict connections between diagnosis groups, patients and 
clonotypes, we used the qgraph package for R. Only TCRs with full a and B chain sequences were included in the analysis. Unweighted 
networks were generated with all subjects and split per diagnosis group. 


Analysis of scRNAseq data 

Differential Expression 

Markers for each cluster were determined by comparing the cells of each cluster to all other cells using the FindMarkers function in 
Seurat with the Model-based Analysis of Single Cell Transcriptomics (MAST) algorithm from the R package ‘MAST’ version 1.8.2. For all 
comparisons between groups and clusters, only genes expressed by at least 10% of cells were included. The R package ‘ggplot2’ version 
3.1.0 was used to plot the results of the differential expression analysis, showing the average log fold change of each gene on the x axis 
and the -log10 of the p value adjusted (Benjamini-Hochberg correction). Seurat was used to produce violin plots of the expression of 
select genes. 
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Pathway analysis 

Panther was used to perform Reactome pathway analysis with genes identified from differential expression analysis (q<0.05) and with all 
genes in the dataset as background. Fisher’s Exact test was with the Bonferroni correction for multiple testing. Z-scores for each pathway 
were calculated using the R package ‘GOPlot’. 


Determination of Antigen Specificity of V(D)J Sequences 
To determine whether TCR sequences identified in our scTCRseq experiments on peripheral TEMRA and CSF cells had known antigen 
specificity, the CDR3b region of each beta chain was compared to the CDR3b repertoire from the VDJdb at https://vdjdb.cdr3.net/. 


Clustering of peripheral CD8+ TEMRA and CSF cells 

Individual sample expression matrices were loaded into R using the function Read10x under the ‘Matrix’ package v1.2-15. The expression 
matrix for each sample was merged into one Seurat object using the CreateSeuratObject and MergeSeurat functions. Seurat package 
v3.043,44 was utilized for filtering, variable gene selection, normalization, scaling, dimensionality reduction, clustering and visualization. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Source data for figures are provided within Extended Data Figures. 


RNA-seq datasets have been deposited online in the Gene Expression Omnibus (GEO) under accession number GSE134578. 


Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Our mass cytometry study measured two independent study groups (healthy vs. MCI/AD) with continuous primary endpoints. We used power 
analyses to determine the minimum number of study subjects required per group, which was calculated to be 42 healthy and 14 MCI/AD. We 


exceded these numbers in the study by analyzing 57 healthy and 23 MCI/AD samples. We established anticipated means for our healthy 
cohort based on preliminary experiments and considered an enrollment ratio of 3:1 (healthy: MCI/AD) based on enrollment data from the 
clinic. We measured group sizes to account for a 20% increase in our MCI/AD group. Our type Il error rate was set to 0.05 and our power level 
set to 90%. Following the study, we performed a post-hoc power analysis with our sample sizes and found our mass cytometry study to have 
92.3% power. 


Data exclusions Subjects who progressed from a clinical diagnosis of mild cognitive impairment to any form of dementia besides Alzheimer's were removed 
from the mass cytometry analysis. 


Replication All experiments in this study were successfully replicated at least twice. 


Randomization — Study participants were allocated into groups following data acquisition. Study subjects underwent a battery of neuropsychological 
assessments to determine group status, including: cognitive examination, evaluation of cerebellar function, deep tendon reflexes, sensory 
input, and motor function. The Montreal Cognitive Assessment (MoCA) examination was used to test study subjects for cognitive impairment. 
The MoCA assesses several cognitive domains: short-term memory recall (5 points), visuospatial abilities (4), executive functions (4), attention 
(1), concentration (3), working memory (1), language (6) and orientation to time and space (6). MoCA scores range between 0 and 30. A score 
of 26 or over is considered to be cognitively typical. 
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Blinding All experimenters were blinded to group allocation during data acquisition and analysis. 


Reporting for specific materials, systems and methods 


aterials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Unique biological materials 


Policy information about availability of materials 


Obtaining unique materials | Fresh human brain samples are difficult to obtain under a short post-mortem window and are thus not readily available. CSF 
samples contain very few cells and can only be analyzed once and are thus not readily available. 


Antibodies 


Antibodies used Flow cytometry was conducted using an LSRFortessa (BD Biosciences). A panel consisting of antibodies conjugated to six 
different fluorophores was used to classify subsets of memory T cells and for droplet-based sequencing. Antibodies used were: 
CD8a-Pacific blue (Clone SK1; BioLegend cat #344718), CD3-BV650 (Clone UCHT1; BD Biosciences cat #563851), CD45RA-APC- 
Cy7 (Clone HI100; BioLegend cat #304128), CCR7-488 (Clone 4B12; BioLegend cat #120110), IL-7Ra-PE (Clone AO19D5; 
BioLegend cat #351304), CD27-PE-Cy7 (Clone M-T271; BioLegend #356412). For CSF cell characterization, this same panel was 
used, but CD19-PE-Cy5 (Clone HIB19 BioLegend cat #302210) and CD14-Qdot-705 (Clone TUK4; ThermoFisher cat #Q22137) 
were included. For sorting CSF T cells for TCR plate sequencing, the following antibodies were used: CD8a-PE (Clone SK1; 
BioLegend cat #344706), CD161-PE-Cy7 (Clone HP-3G10; BioLegend #339917), CXCR3-APC (Clone CXCR3-173; BioLegend cat 
#126512), CD4-APC-Alexa700 (Clone RM4-5; ThermoFisher cat #56-0042-82), CD39-APC-Cy7 (Clone A1; BioLegend cat #328226), 
CD38-FITC (Clone HIT2; BioLegend cat #303504), PD-1-BV421 (Clone EH12.1; BD Biosciences cat #562516), CD45RA-BV605 
(Clone 5H9; BD Biosciences cat #740424), CD3-BV650 (Clone UCHT1; BD Biosciences cat #563851), CD27-BV786 (Clone L128; BD 
Biosciences cat #563328), CD127-BUV395 (Clone HIL-7R-M21; BD Biosciences cat #742547). All antibodies were used at 1:250 
dilution. For each experiment, a compensation matrix was developed using singly stained and unstained controls or fluorescent 
beads, and all analysis was conducted in Cytobank. 


For immunohistochemistry, primary antibodies included polyclonal rat anti-human CD3 (1:200; Abcam cat #ab28364), rabbit 
anti-human CD8a (1:200; Clone D8A8Y; Cell Signaling), mouse anti-AB (1:2000; Clone D54D2; Cell Signaling), polyclonal chicken 
anti-human MAP2 (1:500; Abcam cat #ab5392), mouse anti-human granzyme-A (1:200; Clone GA6; Abcam), rat anti-mouse CD8a 
(Clone 4SM15; eBioscience cat #14-0808-82) and polyclonal rabbit anti-mouse NEFH (Abcam cat #ab8135). For mouse AB plaque 


staining, ThioflavinS (1 mg/mL, 1:625, Sigma) was added to the secondary antibody solution. 


Validation Each primary antibody was validated using proper fluorophore controls and positive signal was evaluated versus an unstained 
sample. All antibodies have been previously published by our group (Han et al., Nat. Biotech., 2014). All antibodies are from 
reputable vendors and come with validation statements on the manufacturer's website. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) SKW-3 cells were provided by Mark Davis' lab (Stanford). Lenti-X 293T cell line was acquired from Takara (cat #632180). 


Authentication Verification of TCR deficiency in SKW-3 cells line was conducted by flow cytometry against TCRab. 293T cells were not 
authenticated, but functionally produced lenti viruses. 


Mycoplasma contamination Negative for mycoplasma 
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Commonly misidentified lines SKW-3 cells are commonly misidentified as KE-37 cells. Our cells were verified to lack TCRs and functionally behave like T cells 
(See ICLAC register) when transduced with T cell receptors. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals APP/PS1 mice expressing a chimeric mouse/human mutant amyloid precursor protein (Mo/HuAPP695swe) and a mutant human 
presenilin 1 (PS1-dE9) directed to CNS neurons under the prion protein promoter were used (The Jackson Laboratory). Mice 
were housed at the Paracelsus Medical University Salzburg in groups under standard conditions at a temperature of 22°C anda 
12 h light/dark cycle with ad libitum access to standard food and water. Over 10 mice aged 9-10 months comprising both sexes 
were randomly analyzed in an unblinded manner. Animal care, handling, genotyping and experiments were approved by 
Paracelsus Medical University Salzburg ethical committees. 


Wild animals The study did not involve wild animals. 


Field-collected samples The study did not involve samples collected from the field. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics A total of 164 subjects from four separate cohorts were used in this study. The 164 subjects used in this study included 97 
healthy, 31 MCI, 28 AD and 8 PD. The average age for healthy subjects was 72.52 + 6.96 (meantstandard deviation), for MCI 
subjects: 70.97 + 7.82, for AD subjects: 70.74 + 7.01 and for PD subjects: 67.25 + 7.01. Average cognitive score for healthy 
subjects was 27.17 + 2.32, MCI: 23.39 + 4.31, AD: 11.05 + 7.32 and PD: 27.29 + 1.70. All subjects were free from acute infectious 
diseases and in good physical condition. Patients were confirmed to have neurodegeneration by measurement of biomarkers, 
including neurofilament, AB and tau (Quanterix). Group characteristics, including demographic, genetic, clinical and biomarker 
data for each group are presented in Extended Data Table 10. 


Recruitment Study subjects were recruited through Stanford's NIH-funded Alzheimer's Disease Research Center (ADRC), the UC San Diego 
ADRC and the UC San Francisco ADRC by general solicitation. The racial makeup of study subjects reflects that of the Stanford 
area and should be considered when analyzing these results. 


Flow Cytometry 
Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


ethodology 


Sample preparation PBMCs were isolated from blood by layering diluted blood (1:1 in PBS) on top of an equal volume of Ficoll, followed by 
centrifugation and isolation of the buffy coat. PBMC samples were stored over the course of a three-year period. Average PBMC 
viability for patient samples following sample thawing was 79% while the average viability for control samples was 77%. 


CSF was collected by lumbar puncture, then centrifuged at 300G to pellet immune cells. CSF samples were checked for blood 


contamination by resuspending the pelleted cells in 100 yl of CSF and mixing 10 pl (10%) CSF with 10 ul trypan blue to assess red 
blood cell content and viability. Cells were visualized on a TC20 automated cell counter (BioRad) and cell viability and presence/ 
absence of red blood cells was recorded. CSF samples contaminated with blood were removed from the study. The resuspended 
cells were then mixed with 900 ul Recovery Cell Culture Freezing Medium (Thermo Fisher). All Samples were frozen overnight at 
-80°C in a Mr. Frosty freezing container (Thermo Fisher) and transferred the following day to liquid nitrogen for storage. CSF 
samples were stored <6 months prior to analysis. 

Instrument BD LSRFortessa 

Software Cytobank v6.3.0.5 


Cell population abundance — 10,000 live leukocytes were gated for downstream SPADE analysis for mass cytomtery. 
For peripheral TEMRA scRNAseq, 50,000 peripheral cells were sorted then loaded into a 10X controller at 1,000 cells/ul. 
For CSF plate-seq, cells were sorted into individual wells of a 96-well plate. 


For CSF cell drop-seq, all cells from each CSF sample were live sorted, pelleted then loaded into the 10X controller. 
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Since CSF cells are in low abundance (approximately 1 cell per ul), processing of samples (i.e. repetitive centrifugation) was 
carefully limited to achieve greater numbers of cells. 


Gating strategy Cells were distinguished from debris by SSC/FSC, then signlets were selected on a FSC-H/FSC-W plot. Cells doubly positive for 
CD3 and CD8 were gated (~10%3 expression in each channel) 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


Magnetic resonance imaging 


Experimental design 


Design type T1 weighted MRI Scans were acquired using Axial 3D fast spoiled gradient sequence (GE Discovery 750). 
Design specifications n/a (this is structural MRI data, not functional) 


Behavioral performance measures n/a 


Acquisition 
maging type(s) Structural 
Field strength 3 Tesla 
Sequence & imaging parameters The imaging parameters were optimized for gray/white matter tissue contrast with a repetition time of 5.9ms, echo 
time 2ms, flip angle 15, field-of-view 220mm, matrix size 256 x 256, slice thickness Imm and 2 NEX. 
Area of acquisition Whole brain 
Diffusion MRI Used Not used 


Preprocessing 


Preprocessing software All the suggested T1-weighted scans use GE's "BRAVO" sequence. It is an IR-prep, fast SPGR sequence with parameters 
tuned to optimize brain tissue contrast. 


Normalization To correct for sex differences, we normalized all volumetric measurements to total intracranial volume for each patient. 


Normalization template MNI305 atlas was used for normalization. First stage was affine registration, followed by initial volumetric labeling. The 
variations due to biasing were corrected. Finally, a high dimensional nonlinear volumetric alignment to the MNI305 atlas 
was performed, and once preprocessing completed, the volume was labeled. 

Noise and artifact removal n/a These are structural data, not fMRI data 


Volume censoring n/a These are structural data, not fMRI data 


Statistical modeling & inference 


Model type and settings Comparisons between groups of subjects were performed using multivariate analysis of covariance, where age served 
as a covariate. 


Effect(s) tested Increase/decrease in brain region volume 


Specify type of analysis: Whole brain ROI-based Both 


Subcortical segmentation including both left and right of Cerebral White Matter, Cerebral Cortex, Lateral 
Ventricle, Inferior Lateral Ventricle, Cerebellum White Matter, Cerebellum Cortex, Thalamus, Caudate, 
Putamen, Pallidum, Hippocampus, Amygdala, Lesion, Accumbens area, Vessel, Third Ventricle, Fourth 
Ventricle, Brain Stem, Cerebrospinal Fluid 


Anatomical location(s) 


Statistic type for inference n/a 
(See Eklund et al. 2016) 


Correction n/a 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 


Graph analysis 


Multivariate modeling or predictive analysis 
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Clinical trials using adult stem cells to regenerate damaged heart tissue continue 

to this day!”, despite ongoing questions of efficacy and a lack of mechanistic 
understanding of the underlying biological effect*. The rationale for these cell therapy 
trials is derived from animal studies that show a modest but reproducible 
improvement in cardiac function in models of cardiac ischaemic injury**. Here we 
examine the mechanistic basis for cell therapy in mice after ischaemia-reperfusion 
injury, and find that—although heart function is enhanced-—it is not associated with 
the production of new cardiomyocytes. Cell therapy improved heart function 
through an acute sterile immune response characterized by the temporal and regional 
induction of CCR2* and CX3CRI* macrophages. Intracardiac injection of two distinct 
types of adult stem cells, cells killed by freezing and thawing or a chemical inducer of 
the innate immune response all induced a similar regional accumulation of CCR2* and 


CX3CRI‘ macrophages, and provided functional rejuvenation to the heart after 
ischaemia-—reperfusion injury. This selective macrophage response altered the 
activity of cardiac fibroblasts, reduced the extracellular matrix content in the border 
zone and enhanced the mechanical properties of the injured area. The functional 
benefit of cardiac cell therapy is thus due to an acute inflammatory-based wound- 
healing response that rejuvenates the infarcted area of the heart. 


Initial animal studies with adult stem cells reported improved heart 
function through the formation of new cardiomyocytes by transdif- 
ferentiation of the injected cells®’”. However, adult stem cell transdif- 
ferentiation was not observed in later studies***, and clinical trials 
using adult stem cells in patients with acute myocardial infarction 
injury or decompensated heart failure have been indeterminate”. 
The mechanistic basis for cell therapy therefore remains unclear, 
although a paracrine hypothesis has previously been proposed”. 
Here we focused on two types of primary adult stem cell: fraction- 
ated bone marrow mononuclear cells (MNCs)—which have extensively 
been used in human clinical trials*—and cardiac mesenchymal cells 
that express the receptor tyrosine kinase KIT, originally termed and 
referred to hereas cardiac progenitor cells (CPCs)”"°. We also examined 
the effect of injecting zymosan, anon-cellular and potent activator of 
the innate immune response”. Isolated MNCs were a heterogeneous 
cell population that comprised all the major haematopoietic lineages— 
although monocytes and granulocytes were predominant (Extended 
Data Fig. 1a). CPCs expressed mesenchymal cell-surface markers but 
were negative for markers of haematopoietic or endothelial cells 
(Extended Data Fig. 1b). 

Uninjured eight-week-old male and female C57BI/6) mice received 
an intracardiac injection of strain-matched MNCs, zymosan or saline 
(Fig. 1a). Histological foci of acute inflammation were observed within 


areas of cell or zymosan injection, as examined by confocal microscopy 
from heart sections three days, seven days or two weeks post-injection 
(Fig. 1b). Activated CD68* macrophages were significantly increased 
within the area of injection after three and after seven days, witha 
diminishing effect by two weeks as the cells or zymosan were cleared 
(Fig. 1b, c). We did not observe any differences in neutrophil levels from 
dissociated hearts at three days after injection (Extended Data Fig. 1c). 

To further profile the induction of macrophages owing to injection 
of MNCs or zymosan, we used Ccr2-RFP” and Cx3cr1-GFP® knock-in 
mice to broadly distinguish the major subtypes of macrophage inthe 
heart'*’’. We delivered unlabelled MNCs or zymosan into eight-week-old 
Ccr2-RFP x Cx3cr1-GFP mice by intracardiac injection (Fig. 1d). Uninjured 
(that is, noninjected) adult hearts showed GFP* (CX3CRI1*) macrophages 
throughout the myocardium, whereas RFP* (CCR2*) macrophages 
were largely absent at baseline (Fig. le). After one day, areas of the 
heart injected with MNCs or zymosan showed a robust accumulation 
of CCR2* macrophages, and CX3CRI1* macrophages began to expand in 
the periphery of the injected area (Fig. le). At three days, these CX3CR1* 
macrophages had also expanded within the injection area along with the 
CCR2* macrophages; this began to resolve by seven days after injection. 
Flow cytometry analysis from these mice at three days after injection 
also indicated a shift in the overall content of the macrophage subtypes 
froma largely CX3CRI*CCR2° population in the naive state to a mix of 
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University of Cincinnati, Cincinnati, OH, USA. “Center for Systems Biology, Department of Imaging, Cardiovascular Research Center, Massachusetts General Hospital of the Harvard Medical 
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Fig. 1| Cardiac cell injection causes local inflammation with the 
accumulation of distinct subtypes of macrophage. a, Experimental scheme 
using eight-week-old male and female C57Bl/6J mice subjected to intracardiac 
injection of strain-matched MNCs, Alexa Fluor 594-conjugated zymosan (zym.) 
or sterile saline (sal.). Sham-treated mice received thoracotomy, but no 
intracardiac injection. MNCs were isolated from Rosa26-mTomato mice on the 
C57Bl/6J background. b, Representative confocal immunohistochemistry 
micrographs of hearts showing activated CD68 macrophages (green), or the 
injected MNCs or zymosan (red). Dashed lines show injection sites. Images are 
froma minimum of 18 histological sections per mouse heart, assessed from 
n=3 (saline at 3 days post-injection (PI), saline at 14 days post-injection and 
MNCsat 14 days post-injection) or n=4 mice (all other groups). Scale bars, 

100 um. c, Quantification of CD68" cells as a percentage of total cells (DAPI'), 
imaged at areas of injection from the groups described inb. Sample sizes for all 
groupsare listed above in b. At 3 days, *P< 0.0001 versus sham, “P= 0.0003 
(MNC versus saline), “P= 0.0015 (zymosan versus saline). At 7 days, *P< 0.0001 
(MNC versus sham), *P= 0.0005 (zymosan versus sham), *P< 0.0001 (MNC 
versus Saline). Significance was determined by one-way analysis of variance 
(ANOVA) with Tukey’s post hoc test. Data are summarized as box-and-whisker 
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enhanced numbers of CCR2* and CCR2*CX3CRI‘ (double-positive) 
macrophages with injection of MNCs or zymosan (Fig. 1f). 

We also used immunohistochemistry to evaluate the ability of 
injected MNCs or zymosan to induce the formation of new cardio- 
myocytes (PCM1°) in the heart (Fig. 2a, b). We observed no appreciable 
increase in cardiomyocyte cell-cycle activity (Ki67*) versus saline- 
injected controls at the areas of injection, or distally across the entire 
tissue (Fig. 2c). Another proposed effect of cell therapy is the activation 
of endogenous CPCs; it has previously been determined that these can 
contribute to cardiac endothelial cell content after myocardial infarc- 
tion injury’®. We used atamoxifen-inducible, Kit™"""* x Rosa26-eGFP 
lineage-tracing mouse strategy to examine endothelial cell formation 
from endogenous CPCs after injection of cells or zymosan (Figs. 2d, e). 
Notably, we did not observe transdifferentiation of injected MNCs or 
CPCs into cardiomyocytes or endothelial cells (Fig. 2f). However, two 
and six weeks after injection, eGFP* endothelial cells were significantly 
increased at the injection sites in zymosan-treated hearts, but not in 
hearts injected with MNCs or CPCs (Fig. 2g, h). Zymosan persisted the 
longest within these hearts, whereas CPCs and MNCs were essentially 
cleared by two weeks after injection (Fig. 2e), which potentially explains 
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plots, indicating the median value (black bar inside box), 25th and 75th 
percentiles (bottom and top of box, respectively), and minimum and maximum 
values (bottom and top whisker, respectively). d, Experimental scheme using 
eight-week-old male and female Ccr2-RFP x Cx3cr1-GFP knock-in mice to 
simultaneously visualize CCR2° and CX3CRI’ subtypes of macrophage, after 
injection of MNCs or zymosan. e, Representative confocal micrographs from 
cardiac histological sections from MNC- or zymosan-injected mice, versus 
naive (non-injected) controls (minimum of 30 sections assessed per mouse 
heart fromn=2 naive control mice andn=3 MNC- or zymosan-injected mice), 
showing endogenous RFP and GFP immunofluorescence from CCR2’ or 
CX3CRI' cells, respectively, at the injection site over a seven-day time course. 
Scale bars, 100 pm. f, Distribution of CCR2° and CX3CRI subtypes of 
macrophage in hearts at three days after injection. Pie charts reflect the 
proportion of RFP (CCR2")- or GFP (CX3CRI1")-expressing cells, as well as 
CCR2*CX3CRI‘ double-positive (yellow) cells detected by flowcytometry, asa 
percentage of total macrophages identified by staining for F4/80 and CD64. 
Data are fromn=6 MNC- and zymosan-injected mice or n=2 naive (non- 
injected) mice. 


why zymosan was more effective. None of the treatments increased 
KIT*-derived endothelial cells in the distal areas of the heart. 

We next injected strain-matched MNCs, CPCs, zymosan or saline on 
each side of the infarct border zone in C57Bl/6J mice one week after 
ischaemia-reperfusion (I-R) injury (Fig. 3a). Importantly, the injection 
of cells or zymosan into uninjured hearts did not alter left ventricular 
structure or function (Extended Data Fig. 2a-f). Injection of MNCs, 
CPCs or zymosan each significantly improved cardiac ventricular per- 
formance after I-R by two weeks after their injection, when compared 
with saline-injected controls (Fig. 3b). Of note, although the intracar- 
diac injection procedure itself (saline) produced a mild inflammatory 
response (Extended Data Fig. 3a-e), it did not improve cardiac function 
after I-R (Extended Data Fig. 3f-i). Injection of cells or zymosan was also 
associated with improvements in left ventricular end-systolic volume, 
consistent with better cardiac function (Extended Data Fig. 4a). By 
contrast, there was nochange in left ventricular end-diastolic volume 
(Extended Data Fig. 4b) or heart rate (Extended Data Fig. 4c) across 
any of the treatment groups at two weeks after therapy. Importantly, 
the observed functional benefit persisted for at least eight weeks after 
the injection of MNCs or zymosan (Fig. 3c). 
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Fig. 2| Cell or inflammatory therapy induces endothelial cell, but not 
cardiomyocyte, formation. a, Schematic of experiments performed inb, cin 
eight-week-old male and female C57BI/6J mice that received intracardiac 
injection of MNCs, zymosan or saline, and were analysed two weeks later. 

b, Representative cardiac immunohistochemistry for Ki67 (green) and PCM1 
(purple) from MNC-injected hearts. DAPI (blue) shows nuclei. A minimum of 

45 histological sections were analysed per mouse heart fromn=4 MNC-treated 
mice, orn=5 mice for all other groups. The yellow box denotes an area shown at 
ahigher magnification in the insets on the right. The yellow arrowhead denotes 
acardiomyocyte with cell-cycle activity. Scale bars, 100 pm (left), 10 pm (right). 
c, Quantification of cardiomyocytes with cell-cycle activity (PCM1°Ki67*) asa 
percentage of all cardiomyocytes imaged (PCM1*) at two weeks after injection. 
Data are froma minimum of 45 cardiac histological sections analysed per 
mouse. n=4 (MNC andat the injection site) or n=5 mice for all other groups. 

d, Schematic of experiments performed in e-h using KIT lineage-tracing mice 
(Kit@"™""* x Rosa26-eGFP) injected with MNCs, CPCs, zymosanor saline, then 
analysed two or six weeks later. Tamoxifen was administered continuously (in 
chow), starting one day before cell injection. e, Representative confocal 


We next treated mice with a high dosage of the broad-spectrum 
immunosuppressant cyclosporine A (CsA), which abrogated the 
restorative effects on cardiac function that were seen with injections 
of MNCs or zymosan after I-R injury (Fig. 3d). Administering clodronate 
liposomes acutely depleted macrophages and abolished the protec- 
tive effect of the MNC injection after I-R in a similar manner to CsA 
treatment (Fig. 3e). Finally, injecting cellular debris from MNCs killed 
by freezing and thawing also improved cardiac function after I-R 
(Extended Data Fig. 4d). 

Cardiac I-R injury itself is associated with a robust, temporal recruit- 
ment of discrete myeloid cell populations”, which we also observed 
using Ccr2-RFP x Cx3cr1-GFP mice (Fig. 3f). Of note, characterization of 
Ccr2-RFP x Cx3cr1-GFP mice at baseline using flow cytometry showed 
that over 90% of the CX3CRI°CCR2 cells in the heart were tissue-res- 
ident CD169* macrophages", with a minor contribution of dendritic 
cells and neutrophils (Extended Data Fig. 5a-f). We also used Ccr2” or 
Cx3cr1’ gene-targeted mice, and—although initial infarct sizes after 
I-Rwere not different among Ccr2” or Cx3cr1’ mice and their strain- 
matched wild-type controls (not shown)—Ccr2 deficiency partially 
improved cardiac function after I-R (Fig. 3h), consistent with previous 
reports’. Moreover, cell therapy by MNC injection in mice that lack 
Ccr2 imparted no further functional benefit beyond the improvement 
seen in these mice after I-R (Fig. 3h). The loss of Ccr2led to a reduction 


immunohistochemistry images from hearts showing CD31’ endothelial cells 
(white) and injected MNCs, CPCs or zymosan (red). eGFP (green) shows Kit- 
allele-derived endothelial cells. Yellow arrowheads denote CD31’ endothelial 
cells that are also eGFP’. Scale bars, 100 um. f, Larger insets of images shownin 
e, (within the boxed areas). Injected MNCs (top) or CPCs (bottom, rotated 90°) 
are shown with red arrowheads indicating mTomato’ cells that are negative for 
CD31 and lack known cardiomyocyte morphology. Scale bars, 20 im. 

g,h, Quantification of the percentage of eGFP* endothelial cells relative to total 
endothelial cells counted, either two weeks (g) or six weeks (h) post-injection. 
All data in e-hare fromn=6 (six weeks after zymosan injection) orn=5 mice 
(all other groups). All Pvalues ing, h were determined by one-way ANOVA with 
Tukey’s post hoc test. All numerical data in this figure (c, g, h)are summarized 
as box-and-whisker plots, indicating the median value (black bar inside box), 
25th and 75th percentiles (bottom and top of box, respectively), and minimum 
and maximum values (bottom and top whisker, respectively). Dataand 
representative micrographs in e-h are froma minimum of 45 histological 
sections analysed per individual mouse heart from the numbers of mice 
indicated above. 


in overall CD68* cell content, specifically at the border zone of hearts 
after |-R—both with and without cell therapy (Fig. 3i). By comparison, 
Cx3cr1-null mice showed left ventricular dysfunction after I-R injury 
that was similar to wild-type controls, but these Cx3cr1-null mice no 
longer benefitted from MNC therapy and showed a much greater 
total inflammatory response (Fig. 3h, i). This result is consistent with 
arecent study that has shown that ablation of CX3CRI* macrophages 
increases mortality and peri-infarct fibrosis after myocardial infarction 
in mice’. However, Cx3cr1-null mice show a GFP” (expressed from the 
Cx3cr1-GFP allele) macrophage content after permanent occlusion 
myocardial infarction injury that is similar to that of control mice, as 
wellas the same content of activated CD68* macrophages in the infarct 
border zone at three days after myocardial infarction (Extended Data 
Fig. 6a-c). This observation is consistent with previous analyses of 
Cx3cr1-null mice, in which monocyte extravasation during peritonitis 
is unaffected” and trafficking of tissue macrophages from the yolk 
sac during developmentis unaltered”°. Given these previous observa- 
tions, the most likely explanation for our results is that CX3CRI defi- 
ciency does not compromise tissue-resident macrophage content, but 
instead affects the function of these macrophages-—resulting in greater 
tissue inflammation. Finally, the acute monocyte and neutrophil 
responses at three days after myocardial infarction were also not dif- 
ferent between Cx3cr1-null mice and heterozygous Cx3cr1-GFP controls 
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Fig.3| Cell or inflammatory therapy rejuvenates heart function after I-R. 

a, Schematic of experiments performed inb, c, in which eight-week-old male and 
female C57BI/6J mice received 120 min of myocardial I-R injury, then (1 week 
later) intracardiac injection of MNCs, CPCs, zymosan orsterile saline flanking 
the injury area, followed by analysis two or eight weeks later. b, Fractional 
shortening (FS) as measured by echocardiography in the groups indicated, two 
weeks after cell or zymosan therapy. *P< 0.0001 (saline versus sham), 

*P= 0.0002 (CPC versus sham), *P= 0.0019 (zymosan versus sham), *P<0.0001 
(MNC versus saline), *P= 0.0157 (CPC versus saline), *P= 0.0019 (zymosan versus 
saline). c, Fractional shortening at eight weeks after cell or zymosan therapy. 
*P<0.0001 (saline versus sham), *P=0.0002 (MNC versus sham), *P= 0.0086 
(zymosan versus sham), *P= 0.0194 (MNC versus saline), *P= 0.0005 (zymosan 
versus Saline). Significance in b, cwas determined by one-way ANOVA with 
Dunnett’s post hoc test. The same sham group is showninbandc, as these 
experiments were performed in parallel. d, Fractional shortening in post-I-R 
male and female mice that received CsA (15 milligrams per kilogram body 

weight per day) delivered by osmotic minipump, starting one day before MNC, 
zymosan or saline injection and continuing for two weeks after injection. 
*P=0.0004 (saline versus sham), *P= 0.0006 (MNC versus sham), *P=0.0018 
(zymosan versus sham), all by one-way ANOVA with Dunnett’s post hoc test. 

e, Fractional shortening in post-I-R male and female mice that received two 
injections of clodronate liposomes delivered intraperitoneally one day before 
MNC injection, and again five days after MNC injection. No difference between 
control and liposome treatment was observed in mice that received 
intracardiac injection of saline after I-R, so these groups were combined 
(denoted as saline). *P< 0.0001 (saline versus sham, or MNC + liposomes versus 
sham), “P= 0.0276 (MNC versus saline), *P= 0.0042 (MNC + liposomes versus 
MNC), all determined by one-way ANOVA with Dunnett’s post hoc test. 

f, Confocal micrographs of histological sections at the infarct border zone of 
hearts from male and female Ccr2-RFP x Cx3cr1-GFP knock-in mice (n=2 mice per 
group and time point, with a minimum of 10 sections assessed per mouse heart) 
at either three days or two weeks after I-R. Scale bars, 100 pm. g, Schematic of 
experiments performedinh, iin male and female Ccr2” or Cx3crI” mice inthe 
C57BI/6J background that were subjected to I-R, and then injected with MNCs or 
sterile saline one week later. h, Fractional shortening in Ccr2” or Cx3cr1’ mice 
or wild-type (WT) C57BI/6J mice two weeks after cell injection, or non-injured 
mice (pre). *P< 0.0001 (wild type, saline versus pre), *P= 0.0004 (Ccr2”, MNC 
versus pre), *P=0.0001 (Cx3crI”, saline versus pre), *P=0.0002 (Cx3cr1’, MNC 
versus pre), *P=0.0030 (wild type, MNC versus saline), all by one-way ANOVA 
with Tukey’s post hoc test. i, Quantification of CD68" cells as a percentage of 
total cells (DAPI*) imaged at the infarct border zone, three weeks after I-R. 
*P=0.0001 versus wild type and saline, *P< 0.0001 versus Ccr2” and saline, all 
by one-way ANOVA with Tukey’s post hoc test. The number of mice in each of the 
experimental groups is indicated below or within the respective graphs. All 
numerical data are summarized as box-and-whisker plots, indicating the median 
value (black bar inside box), 25th and 75th percentiles (bottom and top of box, 
respectively), and minimum and maximum values (bottom and top whisker, 
respectively). 
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(Extended Data Fig. 6d). However, we observed significantly greater 
mortality after myocardial infarction in Cx3cr1-null mice (Extended 
Data Fig. 6e), which suggests that CX3CRI‘ cells have animportant role 
inthe later stages of infarct maturation (after day 3) and/or inthe modu- 
lation of the fibrotic response, as has also recently been proposed» 

Mechanistically, extracellular matrix content in the peri-infarct bor- 
der zone was noticeably decreased with injection of MNCs around the 
I-R injury area (Fig. 4a, b). We also observed this decrease with the 
injection of non-viable MNCs, which suggests that it was primarily due 
to immunoreactivity—and not to active paracrine signalling (Fig. 4c). 
Notably, infarct strips from MNC-injected hearts produced a signifi- 
cantly greater change in passive force over increasing stretch, a profile 
that was more similar to that of the uninjured heart (change in initial 
length (L,)) (Fig. 4d). This profile was also associated with a decrease in 
gene expression of several components of the extracellular matrix and 
genes that underlie the fibrotic response in MNC- versus saline-treated 
hearts after I-R injury (Fig. 4e). We repeated the force-lengthening 
assay on infarct strips from post-I-R hearts injected with zymosan, 
which showed an even larger improvement in passive force dynamics 
compared with the saline or MNC treatment (Extended Data Fig. 7a). 

We next isolated bone marrow-derived macrophages or peritoneal 
macrophages from naive mice and cultured them on prefabricated 
collagen patches, followed by second harmonic generation micros- 
copy to examine collagen organization (Fig. 4f). Bone marrow-derived 
macrophages and peritoneal macrophages each generated different 
patterns of collagen reorganization. To extend these observations, we 
used a collagen hybridizing peptide” reagent, which detects imma- 
ture or denatured collagen and areas of active remodelling, on injured 
mouse hearts (Extended Data Fig. 7b). Hearts from MNC- or zymosan- 
treated mice showed reactivity to the collagen hybridizing peptide that 
was coincident with regional localization of CCR2* versus CX3CRI1* 
macrophages within the microenvironment of the infarct border zone 
(Extended Data Fig. 7c). Finally, we isolated CCR2* or CX3CRI* mac- 
rophages from hearts at seven days after I-R (Fig. 4g) and cultured them 
with freshly isolated cardiac fibroblasts for 72 h. Gene expression analy- 
sis revealed that CCR2* macrophages increased fibroblast expression 
of smooth muscle a-actin (Acta2), lysyl oxidase (Lox) and collagen type 
1a 2 (Colla2) (Fig. 4h, i,k). By contrast, CX3CRI1* macrophages slightly 
reduced the expression of these genes but increased fibroblast expres- 
sion of connective tissue growth factor (Ctgf, also known as Ccn2)” 
(Fig. 4j). Together, these results demonstrate that specific subtypes of 
macrophage mobilized by cell therapy differentially affect the passive 
mechanical properties of the cardiac infarct area by influencing the 
activity of cardiac fibroblasts. 

As suggested over a decade ago”, we observed that the acute inflam- 
matory responseisa primary beneficial effect that underlies cell ther- 
apy in the injured heart after myocardial infarction. We identified a 
mechanism by which temporary stimulation of the intrinsic wound- 
healing cascade and select subtypes of macrophage positively affect 
the extracellular matrix around and within the infarcted region of the 
heart, such that functional performance is improved. These results 
are consistent with recent reports that demonstrate key functional 
distinctions between CCR2* and CX3CR1*CCR2 macrophages in car- 
diac wound healing”. In conclusion, our data suggest a need for the 
re-evaluation of current and planned clinical trials based on cardiac 
cell therapy to maximize the effects of the most prevalent underlying 
biological mechanism of action. 
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Fig. 4| Cell therapy benefits the mechanical properties of the infarct via 
remodelling of the extracellular matrix. a, Schematic of experiments performed 
inb-e. b, Representative cardiac histological images stained with picrosirius red 
from the infarct border zone of mice three weeks after I-R, subjected to MNC or 
saline injection. Fibrosis is shown inred. Scale bars, 100 pm. c, Quantification of 
fibrotic area at the infarct border zone in hearts treated with MNCs, MNCs killed 
by freezing and thawing (-Fz) or saline, three weeks after I-R. Pvalues are shownin 
the panel and were calculated by one-way ANOVA with Tukey’s post hoc test. 
Images in band quantification inc are fromn=5saline-treated mice, n=12 MNC- 
treated mice orn=7 mice treated with MNCs killed by freezing and thawing, witha 
minimum of 20 histological sections assessed from each individual mouse heart. 
d, Change in passive force generation over increasing stretch lengthening (per 
cent ofL,) inisolated infarct strips from MNC- or saline-treated hearts, three 
weeks after I-R. Exact Pvalues are shown in the panel versus I-Rand saline, and 
were calculated by Student’s two-tailed t-test. Data from one untreated control 
heart (noI-Ror cell therapy) are shown for comparison. e, mRNA expression levels 
by PCR with reverse transcription (RT-PCR) for selected genes associated with 
fibrosis and the extracellular matrix inisolated infarct regions from MNC or saline- 
treated hearts, three weeks after I-R. Exact Pvalues are shownin the panel versus 
I-Randsaline, by Student’s two-tailed t-test. f, Representative confocal 
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Methods 


Mice 

This study was performed entirely in mice, using transgenic models that 
are commercially available as described later in this section. Nohuman 
subjects or human material were used. All experiments involving mice 
were approved by the Institutional Animal Care and Use Committee 
(IACUC) at Cincinnati Children’s Hospital under protocol IACUC2018- 
0047. All procedures were performed in compliance with institutional 
and governmental regulations under PHS Animal Welfare Assurance 
number D16-00068 (A3108-01). The generation and characterization 
of mice carrying the tamoxifen-inducible MerCreMer recombinase 
cDNA within the Kit allele (Kit), and reporter mice carrying 
the Cre-regulated loxP-stop cassette and eGFP within the Rosa26 gene 
locus, Rosa26-eGFP (R-GFP), have previously been described”®. All the 
other mouse strains were purchased from The Jackson Laboratory, as 
follows: C57BI/6J; (no. 000664), constitutive mTomato-expressing 
mice targeted in the Rosa26 locus for MNC or CPC isolation; (B6.129(Cg)- 
Gt(ROSA)26Sortm4(ACTB-tdTomato, -EGFP)Luo/J, no. 007676), Ccr2- 
gene-deleted mice; (B6.129$4-Ccr2tmIlfc/, no. 004999), Cx3cr1-GFP 
knock-in mice (Cx3cr1 homozygotes are nulls); B6.129P-Cx3critmILitt/, 
no. 005582) and Ccr2RFP knock-in mice; (B6.129(Cg)-Ccr2tm2. lfc/J, no. 
017586). Both male and female mice were used in all experiments, at age 
ranges indicated in the figures and text for each experiment. Mice were 
housed with mice of the same sex at amaximum of 4 mice per cage ina 
specific-pathogen-free, temperature-controlled vivarium under a12-h 
light-dark cycle with ad libitum access to food and water. Randomiza- 
tion was not performed, because mice are genetically identical, housed 
together and of the same age ranges and sex ratios. 


Preparation of cell or inflammatory therapies 

To generate MNCs for injection, whole bone marrow was first isolated 
by flushing dissected femurs and tibiae of 10-12-week-old Rosa26-mTo- 
mato-expressing mice or C57BI/6J mice with 10 ml of sterile Hanks Bal- 
anced Salt Solution (HBSS, Fisher Scientific no. SH3058801) + 2% bovine 
growth serum (BGS, Fisher Scientific, no. SH3054103) + 2mM EDTA, 
as previously described. This suspension was then filtered througha 
40-"m mesh strainer (Fisher Scientific no. 22-363-547), centrifuged at 
400g for 10 minat 4 °C, resuspended in3 ml of sterile saline and layered 
ontop of 4 ml of Ficoll Paque Plus (GE Healthcare no. 17-1440-02). Cells 
were then centrifuged at 2,500g for 30 min at 4 °C inaswinging bucket 
rotor centrifuge without brakes. MNCs were isolated by removal of the 
resulting thin mononuclear cell layer (the second layer from the top). 
Total MNCs were counted with a haemocytometer, washed twice with 
sterile saline and resuspended insterile saline at a final concentration 
of either 2.5 x 10° cells per millilitre (for injection into uninjured hearts, 
a final dose 50,000 cells) or 7.5 x 10° cells per millilitre (for injection 
into post-I-R hearts, a final dose 150,000 cells). A higher dose of cells 
(MNCs or CPCs, see ‘Mouse procedures’) was used in post-I-R hearts 
to account for greater cell loss inthe setting of damaged myocardium 
and toalign with previous studies*». The full intracardiac injection pro- 
cedure is described in ‘Mouse procedures’. Cell viability was tested by 
incubating an aliquot of the MNC suspension with eFluor 450 Fixable 
Viability Dye (eBioscience no. 65-0863-18) and was found to be over 
90% viable at the time of injection. All MNC preparations for injection 
were combined from an equal number of male and female mice. For 
experiments that used non-viable MNCs (frozen and thawed), this 
final MNC suspension was split into two equal aliquots. One aliquot 
was placed immediately at —80 °C for 10 min, followed immediately 
by incubation at 55 °C for 10 min, and this was repeated for a total of 
3 freeze-thaw cycles. 

To generate CPCs for injection, hearts from 10-12-week-old Rosa26- 
mTomato-expressing male and female mice were rapidly excised and 
briefly rinsed in cold 1x PBS. Single-cell suspensions from these hearts 
were prepared according to previously published protocols”>”* with 


minor modifications. The atria were removed, and the ventricles were 
minced onice using surgical scissors into approximately 2-mm pieces 
(8-10 pieces per mouse heart). Each dissociated ventricle was trans- 
ferred into 2 ml of digestion buffer in 1 well of a 12-well tissue culture 
plate. Digestion buffer consisted of 2mg/ml collagenase type IV (Wor- 
thington, no. LSO04188), 1.2 U/ml dispase II (Roche, no. 10165859001) 
and 0.9 mM CaCl, in1x HBSS. Tissues were incubated at 37 °C for 20 min 
with gentle rotation followed by manual trituration 12-15 times witha 
10-ml serological pipette, such that all the tissue pieces were able to pass 
through the pipette. The tissues were settled by sedimentation and the 
supernatant was passed through a40-pm mesh strainer and stored on 
ice. Two millilitres of fresh digestion buffer was added, followed by 2 
additional rounds of incubation, trituration and replacement of super- 
natant with fresh digestion buffer, except trituration was performed 
with a 5-ml serological pipette for round 2 and a1-ml p1000 pipette 
tip (USA Scientific, no. 1112-1720) for round 3. The pooled supernatant 
from the 3 rounds of digestion was washed with sterile PBS and centri- 
fuged at 200g for 20 min at 4 °C in a swinging bucket rotor centrifuge 
without brakes. The pellet was resuspended in flow cytometry sorting 
buffer, consisting of 1x HBSS supplemented with 2% BGS and 2 mM 
EDTA, and incubated with anti-KIT microbeads (Miltenyi Biotec no. 
130-091-224) for 20 min at 4 °C with gentle rotation. The suspensions 
were washed twice with sorting buffer and KIT* cells were enriched via 
positive selection over Miltenyi Biotec LS columns (no. 130-042-401) 
using benchtop magnetic cell separation (MACS) according to the 
manufacturer’s instructions. Cells isolated via positive MACS selec- 
tion for KIT were cultured in DMEM-F12-Glutamax medium (Gibco 
no. 10565-018) supplemented with 10% fetal bovine serum (Sigma, no. 
F2442), 0.2% insulin-transferrin—selenium (Lonza, no. 17-838Z),20ng/ 
ml basic recombinant human fibroblast growth factor (Promega, no. 
G5071), 10? U/ml leukaemia inhibitor factor (Millipore, no. ESG1106), 
20 ng/ml epidermal growth factor (Sigma, no. E9644) and 1% penicil- 
lin-streptomycin (Fisher Scientific, no. 30-002-Cl). 

KIT’ isolated cells from the heart are referred to as CPCs (as previously 
named), although recent data would suggest that these cells transdiffer- 
entiate only into endothelial cells, and not cardiomyocytes, in vivo?*"*. 
KIT* isolated cells were expanded in culture for 12-15 passages before 
being used for injection, at which point cells were washed 3 times with 
sterile saline, trypsinized, counted and resuspended in sterile saline 
at a final concentration of either 2.5 x 10° cells per millilitre (for injec- 
tion into uninjured hearts, a final dose 50,000 cells) or 7.5 x 10° cells 
per millilitre (for injection into post-I-R hearts, a final dose 150,000 
cells). As with MNCs, suspensions comprised pooled CPCs from an 
equal number of male and female mice. 

Alexa Fluor 594-conjugated zymosan, or unconjugated zymosan, 
were purchased from Thermo Fisher (zymosan A (Saccharomyces cer- 
evisiae) BioParticles, Alexa Fluor 594 conjugate, no. Z-23374, zymosan 
A(S. cerevisiae) BioParticles, unlabelled, no. Z2849). A suspension of 
either 1 mg/ml (for injection into uninjured hearts, a final dose of 10 pg) 
or 2 mg/ml (for injection into post-I-R hearts, a final dose of 20 pg) was 
prepared for injectionin sterile saline according to the manufacturer’s 
instructions. Zymosan dosing was extrapolated from a previous study 
using injection into neonatal mouse hearts”. A higher dose of zymosan 
was used in post-I-R hearts to enable comparisons with MNC- and 
CPC-treated animals. 


Mouse procedures 

To deliver cell or inflammatory therapies by intracardiac injection, 
mice were anaesthetized using isoflurane inhalation (to effect) and 
intubated, and a left lateral thoracotomy was performed. A 25-1 gas- 
tight syringe (Hamilton, no. 7654-01) fitted with a 33-gauge needle 
(Hamilton, no. 7803-05) was used for injections. For experiments in 
mice without injury, 20 pl of MNCs at a concentration of 2.5 x 10° cells 
per millilitre (SO,000 cells in total) was injected over 3 regions of the 
left ventricle (6.7 pl per injection). For experiments with zymosan, 


10 pl of al mg/ml suspension was injected over 3 regions of the left 
ventricle (3.3 pl per injection, 10 pg zymosan in total). Injections 
were equidistant along the anterior wall of the left ventricle, with the 
needle entering just parallel to the long axis of the ventricle to avoid 
entering the left ventricular chamber. For experiments with injections 
occurring after cardiac injury, two injections were performed, oneon 
either side of the infarct zone. Twenty microlitres total of MNCs or 
CPCs at aconcentration of 7.5 x 10° cells per millilitre (150,000 cells 
total) was injected (10 pl per injection). For experiments with zymosan, 
10 pl total of a2 mg/ml suspension was injected (5 pl per injection, 20 
Lg zymosan in total). A greater number of cells and higher amount of 
zymosan were injected into the injured heart compared with the unin- 
jured (naive) heart, because of less retention and a greater turnover of 
the cells or zymosan owing to the infarction injury. The sham procedure 
for intracardiac injection consisted of anaesthesia, intubation and 
thoracotomy as performed for the previously described myocardial 
infarction procedure”. The chest was then immediately closed, and 
mice recovered. 

To induce cardiac injury, we used a modified surgical model 
of I-R via temporary left coronary artery ligation as previously 
described”, in which 120 min of ischaemia was used before induc- 
ing reperfusion, which gave more complete killing of the ischaemic 
zone and greater reproducibility. After each surgical procedure (I-R 
or intracardiac injection), animals were given post-operative analge- 
sics (buprenorphine, 0.1 milligram per kilogram body weight) and 
allowed to recover until the experimental time points indicated, at 
which point mice were then further analysed, or tissue was collected. 
For experiments with permanent-occlusion myocardial infarction, the 
same procedure was performed except the ligature was not released. 
Randomization was not performed because the mice used here are 
genetically identical, housed together and of the same age ranges and 
sex ratios. A discussion of blinding and sample elimination considera- 
tions can be found in ‘Statistical information and experimental rigour 
(blinding)’. 

Infarct size and area-at-risk after -R was determined using triphenyl 
tetrazolium chloride and Evans blue staining, as previously described”®. 
In experiments using immunosuppression via CsA, mice were anaesthe- 
tized with 2% isoflurane inhalation to effect, and osmotic minipumps 
(Alzet, no. 1002) were implanted subcutaneously on the left lateral side 
of the mouse. Minipumps were loaded with CsA (Neoral, Novartis, NDC 
0078-0274-22) dissolved in Cremophor EL (Sigma, no. C5135) such that 
15 milligrams CsA per kilogram of body weight was delivered per day”. 
In experiments using acute macrophage ablation, mice were adminis- 
tered 2 doses of 0.2 ml each of clodronate liposomes (Clophosome, no. 
F70101C-N) via intraperitoneal injection, one day before cell therapy 
and again on day 5S after cell therapy. Liposomes were kept on ice until 
administration and were rapidly mixed by inversion to deliver a uniform 
suspension. Control mice were injected with 0.2 ml of sterile saline. In 
experiments using Kit’"""* x Rosa26-eGFP genetic lineage-tracing 
mice, tamoxifen was administered as previously described” via ad 
libitum feeding with premanufactured tamoxifen chow (tamoxifen 
citrate 40 milligrams per kilogram body weight per day, Envigo, no. 
TD.1308603) for the time indicated in each individual experiment. 
For echocardiographic analysis of cardiac structure and function, 
mice were anaesthetized with 2% isoflurane inhalation to effect and 
analysed using a Vevo2100 instrument (VisualSonics) with an 18-38- 
MHz transducer as previously described*°. Randomization was not 
performed because mice are genetically identical and of the same age 
ranges. A discussion of blinding and sample elimination considerations 
can be found in ‘Mice’ and ‘Statistical information and experimental 
rigour (blinding)’. 


Histology and immunohistochemistry 
Primary antibodies and dilutions used for immunohistochemistry are 
listed in Supplementary Table 1. For histological analysis, mice were 


anaesthetized by isoflurane inhalation and killed by cervical disloca- 
tion. The chest was opened, and the heart was flushed with cold car- 
dioplegia solution (1 M KCl in 1x PBS) via cardiac apical insertion of a 
25-gauge needle. The left atrium was cut to allow drainage of blood from 
the heart, and mice were briefly perfused with cold fixative (4% para- 
formaldehyde in sodium phosphate buffer, pH 7.4) through the apex of 
the heart. Tissues were excised, flushed with fixative and incubated in 
cold fixative for 3.5 hat 4 °C with gentle rotation. Tissues were washed 
3 times in cold 1x PBS and then cryopreserved by incubation in 30% 
sucrose in 1x PBS overnight at 4 °C with gentle rotation. Tissues were 
then embedded in TissueTek optimal cutting temperature medium 
(VWR, no. 25608-930) and flash-frozen at —80 °C. Five-micrometre 
cryosections were cut using a Leica CM1860 cryostat. 

Picrosirius red staining was performed with a kit from Abcam 
(ab150681), as per the manufacturer's instructions. High-magnification 
images of hearts stained with picrosirius red were captured at 200x 
magnification using an Olympus BX51 microscope equipped with a 
single chip colour CCD camera (DP70) and DP controller software 
(Olympus America, v.3.1.1.). Border zone fibrosis was quantified as 
the percentage of picrosirius red-stained area over total tissue area 
analysed, as previously described”. 

All detection of genetic reporter-driven mTomato, RFP, GFP or eGFP 
expression was performed using endogenous fluorescence without 
antibody labelling. Immunohistochemistry was performed on cardiac 
cryosections as previously described” with the following modifica- 
tions (see Supplementary Table 1 for primary antibodies and dilutions). 
Alexa Fluor fluorochrome-conjugated secondary antibodies were used 
at.a1:200 dilution for visualization (Life Technologies). For immunohis- 
tochemistry using antibodies against PCMI1, antigen retrieval was first 
performed by incubation with 1% SDS for 5 min at room temperature 
with gentle rotation. PCM1 antibody was used to specifically identify 
cardiomyocytes in heart tissue sections. Histological cardiac sections 
were washed thoroughly in 1x PBS before proceeding. For immunohis- 
tochemistry using the collagen hybridizing peptide (CHP) (3Helix, no. 
BIO300), a stock solution of 15 pm biotin-conjugated CHP was prepared 
according to the manufacturer’s instructions. The solution was heated 
to 80 °C for 5 min to denature the peptide, as previously described”, 
followed by rapid cooling onice and incubation on tissue sections over- 
night at 4 °C. Sections were then washed and processed for secondary 
antibody staining with fluorophore-conjugated streptavidin antibody 
used at a 1:200 dilution for visualization (Life Technologies). Confo- 
cal microscopy and image acquisition were performed using a Nikon 
Eclipse Ti inverted microscope equipped with a Nikon A1R confocal 
running NIS Elements AR 4.50. 


Passive force measurements 

Tissue strips from the infarct region of the left ventricle (or the left 
ventricular free wall for uninjured hearts) were dissected using a 
Zeiss Discovery V8 dissection microscope. Tissues were cut into 3mm 
(length) x 2-mm (width) strips, and 3 or 4 strips were cut from each 
infarct region of the left ventricle. These strips contained scar and a 
small region of border zone oneachend. Tissue strips were maintained 
in M199 media (Corning Cellgro, 10-060-CV) with no supplementa- 
tion throughout the procedure of force measurements. Tissue strips 
were attached to aluminium t-clips (Kem-Mil, no. 1870) and mounted 
onto a permeabilized muscle fibre test apparatus (Aurora Scientific, 
Model: 802D-160-322) initially set to zero tension. Cardiac tissue length 
was then increased 5% over 50 ms, held for 450 ms and then stretched 
again from 5% to 50% in intervals of 5% with no period of relaxation, and 
force was monitored using DMC v600A software (Aurora Scientific). 
Change in force was calculated as the difference between maximum 
force generated after the 50-ms pull and the minimum force achieved 
after each time period. Minimum force was calculated when the rate of 
force decay was zero by solving for the derivative of the best-fit trend 
line, which was a second-degree polynomial equation. 
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RT-PCR from isolated infarct tissues 

Tissue strips isolated from the infarct region as described in 
‘Passive force measurements’ were homogenized with a Precellys 
24 homogenizer (Bertin Instruments no. 03119.200.RDOOO) and 
RNA was isolated by using the RNeasy fibrous tissue kit according 
to the manufacturer’s instructions (Qiagen no. 74704). One micro- 
gram of total RNA was reverse-transcribed using random oligo-dT 
primers and a Verso cDNA synthesis kit (Thermo Fisher Scientific no. 
AB1453A) according to the manufacturer’s instructions. Real-time PCR 
was performed using Sso Advanced SYBR Green (BioRad no. 1725274) 
according to the following programme: one cycle of 95 °C for 10 min, 
one cycle of 95 °C for 15s, 40 cycles of 95 °C for 15 s, 57 °C for 10 s and 
62°C for 30s, and one cycle of 62 °C for 30 s. Gapdh expression was 
used for normalization. Primer sequences used are included in Sup- 
plementary Table 2. 


Flow cytometry and cell sorting 

For analysis of surface markers on MNCs or CPCs, cells were resus- 
pended in 1x HBSS supplemented with 2% BGS and 2 mM EDTA and 
incubated with fluorophore- or biotin-conjugated primary antibod- 
ies (Supplementary Table 1) for 20 min at 4 °C with gentle rotation. 
Cells were then washed twice with 1x HBSS + 2% BGS and 2 mM EDTA. 
For detection of biotinylated antibodies, cells were incubated with 
streptavidin-conjugated BV421 (BD Horizon no. 563259) for 15 min 
at 4 °C with gentle rotation and then washed twice with 1x HBSS + 2% 
BGS and 2 mM EDTA. Samples were analysed using a BD FACSCanto 
running BD FACSDiVa V.8.0 software (BD Biosciences) and using the 
following laser configuration: blue (488 nm), yellow-green (561 nm) 
and red (635 nm). Analysis and quantification were performed using 
FlowJo v.10 (Tree Star). 

For flow cytometry analysis of whole heart cardiac immune cell 
content, single-cell suspensions were first prepared using enzymatic 
dissociation and trituration as described in ‘Preparation of cell or 
inflammatory therapies’. Mice were anaesthetized by 2% isoflurane 
inhalation to effect and killed by cervical dislocation. Hearts were rap- 
idly excised and briefly rinsed in cold cardioplegic solution (1M KCI 
in 1x PBS) before enzymatic dissociation. The pellet resulting from 
dissociation was resuspended in 1 ml of red blood cell lysis buffer 
(150 mM NH,CI, 10 mM KHCO, and 0.1 mM Na, EDTA) and incubated at 
room temperature for 5 min. Samples were then centrifuged at 400g 
for 10 min at 4 °C and resuspended in 1x HBSS supplemented with 2% 
BGS and 2 mMEDTA. Cells were incubated with fluorophore-conjugated 
primary antibodies for 20 min at 4 °C with gentle rotation, washed twice 
with 1x HBSS and analysed using a BD LSRFortessa running BD FACS- 
DiVa v.8.0 software (BD Biosciences) and using the following laser con- 
figuration: UV (355 nm), violet (405 nm), blue (488 nm), yellow-green 
(561nm) and red (635 nm) to detect fluorophore-conjugated antibod- 
ies and/or endogenous RFP and GFP signal from Ccr2-RFP x Cx3cr1-GFP 
mice. Analysis and quantification were performed using FlowJo v.10 
(Tree Star.). 

For isolation of cardiac CCR2* or CX3CRI* macrophages by fluo- 
rescence-activated cell sorting, hearts from Ccr2-RFP x Cx3cr1-GFP 
mice at 7 days after I-R injury were isolated and dissociated to single- 
cell suspensions as described in ‘Preparation of cell or inflammatory 
therapies’ for isolation of CPCs, except that the digestion solution was 
made in DMEM + 2% BGS and 1% penicillin-streptomycin instead of 
HBSS. Isolated cells were sorted by fluorescence-activated cell sorting 
using a Sony SH800S benchtop cell sorter in a BSL-2 biosafety cabinet. 
Endogenous RFP and GFP fluorescence were detected using 4 collinear 
lasers and CCR2* (RFP*GFP_) or CX3CRI* (GFP*RFP* or RFP) cells were 
sorted into 1.5-ml Eppendorf tubes containing DMEM + 10% BGS and 
1% penicillin-streptomycin. Cells were then cultured on isolated car- 
diac fibroblasts as described in ‘Cardiac fibroblast and macrophage 
co-culture’. 


Cardiac fibroblast isolation 

Hearts were excised from ten eight-week-old male and female C57BI/6J 
mice, and the ventricles and septum were isolated, rinsed in ice-cold 
PBS and minced into small pieces using sterile microscissors. Tissue 
fragments were digested in DMEM + 2% BGS and 1% penicillin-strepto- 
mycin containing type 2 collagenase (100 units per millilitre; LS004177, 
Worthington) for 20 min at 37 °C under gentle agitation. The digested 
tissue was triturated repeatedly to promote tissue dissociation. Dense 
fragments settled for 2 min and the supernatant, containing the cardiac 
fibroblasts, was collected and spun at 100g for 5 min. The cell pellet 
was resuspended in10 ml of DMEM + 10% BGS and 1% penicillin-strep- 
tomycin and kept on ice. This process was repeated three times, until 
all the tissue was adequately digested. To remove cardiomyocytes 
and cell debris, cell suspensions were spun at 30g, followed by cen- 
trifugation of the supernatant at 100g. The final cell pellet containing 
the cardiac fibroblasts was resuspended in DMEM + 10% BGS and 1% 
penicillin-streptomycin and pre-incubated on 0.1% gelatin-coated 
plates for 2 h to allow fibroblast adherence before replenishment of 
the cell culture medium. 


Cardiac fibroblast and macrophage co-culture 

Isolated cardiac fibroblasts were split into 24-well 0.1% gelatin-coated 
plates at aseeding density of 15,000 cells per well and allowed to adhere 
overnight. Macrophage subtypes (CCR2* and CX3CRI' cells) isolated 
as described in ‘Flow cytometry and cell sorting’ were then seeded 
onto these cardiac fibroblasts at a density of 10,000 macrophages per 
15,000 fibroblasts. Control fibroblasts received an equivalent amount 
of culture medium containing no macrophages. Adherence was veri- 
fied the following day by fluorescence microscopy for RFP or GFP. Cells 
were isolated 72 h later for mRNA quantification. 


mRNA isolation and qRT-PCR from cultured cardiac fibroblasts 
Total RNA was purified from cultured cells with TRIzol reagent (Fisher 
Scientific no. 15596018) according to the manufacturer’s instructions. 
Two hundred nanograms of RNA was reverse-transcribed to cDNA using 
the Verso cDNA synthesis kit (Thermo Fisher Scientific no. 277.97). 
Quantitative PCR was performed using SsoAdvanced Universal SYBR 
Green Supermix (BioRad no. 1725274) and assayed in duplicate, accord- 
ing to the manufacturer’s instructions ina CFX96 PCR system. Primer 
sequences are included in Supplementary Table 2. All data were normal- 
ized to Gapdh (verified to not deviate between samples). 


Macrophage culture on fibrillar collagen patches 

Pre-sterilized resorbable collagen membranes (Ace Surgical Supply no. 
509-3040) were cut into circles of a uniform thickness and a diameter 
of 6mm, and then placed into 96-well plates. Bone marrow or perito- 
neal macrophages were isolated as previously described”? and cul- 
tured in DMEM + 10% BGS and 1% penicillin—streptomycin. Cells were 
then seeded onto the collagen patches at a density of 10,000 cells per 
patch. Control patches were incubated in culture medium without 
macrophages. Five biological replicates were performed per group. 
After five days in culture, fibrillar collagen assembly was analysed by 
second harmonic generation microscopy using a Nikon A1R multipho- 
ton upright confocal microscope equipped with a tuneable Coherent 
Chameleon II TiSapphire IR laser set to 840 nm. Three images were 
randomly taken per patch and assessed by a blinded investigator to 
select the representative images for each group. 


Statistical information and experimental rigour (blinding) 

All statistical tests used and graphical depictions of data (means and 
error bars, or box and whisker plots) are defined within the figure leg- 
ends for the respective data panels. Exact n values for all experiments 
with statistical analysis are included in the figure legends or within 
the figure itself. For comparisons between two groups, unpaired or 


Student’s two-tailed t-tests were performed as noted within figure 
legends. For comparisons between more than two groups, a one-way 
ANOVA with Tukey’s or Dunnett’s post hoc test was performed as noted 
within figure legends. P< 0.05 was considered as statistically significant. 
Data in Fig. 4i used a Kruskal-Wallis test with Dunn’s multiple compari- 
sons test, as these data did not follow a normal distribution. All other 
data were found to follow a normal distribution as determined by the 
Shapiro-Wilk normality test or Kolmogorov—-Smirnov test (a=0.05). For 
experiments involving I-R surgery, the number of mice that received 
surgery was determined based on previous experimentation in the 
laboratory, which demonstrated a peri-operative surgical mortality 
rate of 20%. Only mice that did not survive a given surgical procedure, 
or which were found at the time of I-R surgery to have incomplete 
reperfusion (failure of slipknot suture release), were excluded from 
analysis; otherwise no exclusions occurred. Randomization of mice 
within a group to receive a given surgical procedure (I-R versus sham) 
or treatment (saline versus cells or zymosan) was not needed because 
the mice were genetically identical and were littermates, although 
equal sex ratios and age ranges were maintained. Echocardiographic 
analysis, quantification of eGFP* endothelial cells, quantification of 
cardiomyocytes in cell cycle, measures of fibrosis, measures of tissue 
passive force, in vivo and in vitro gene expression analysis, and analysis 
of collagen organization in culture were conducted by investigators 
blinded to experimental treatment or procedure. Quantification of 
macrophage content by immunohistochemistry was performed using 
automated fluorescence threshold analysis in NIS Elements 4.50. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All raw data generated or analysed in this study are available from the 
corresponding author upon reasonable request. Original source data 
used to generate graphs in each of the figures and Extended Data figures 
are available as Microsoft Excel data sheet files from the correspond- 
ing author. 
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Extended Data Fig. 1| Characterization of cells used in the injection studies 
and initial neutrophil response to injections. a, Flow cytometry analysis of 
MNCs isolated from Rosa26-mTomato mice for intracardiac injection. Singlet 
cells were selected by forward and side scatter properties followed by 
mTomato-positivity. mTomato’ cells were then assessed for surface expression 
of CD3e, CD11b, Ly6c, CD45R, Ly6G or Ter119 via antibodies. b, Flowcytometry 
plots for CPCs immunolabelled with antibodies against mesenchymal (Scal), 
endothelial (CD31) or haematopoietic (CD45) lineage markers as indicatedin 
the plots. Anunstained negative control (unst.) plot is also shown. Gating was 
determined versus unstained negative controls. Similar results ina, b were 
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obtained from atleast three independent cell preparations. c, Quantification 
via flow cytometry of total neutrophil levels in dissociated hearts from MNC-, 
zymosan.- or saline-injected male and female C57BI/6) mice, three days after 
injection. Asacomparison, data from n=5 C57BI/6J mice isolated one day after 
I-Rinjury are also shown when neutrophil levels are high. Data are fromn=4 
(MNC and zymosan) or n=2 (saline) mice. Numerical data are summarized as 
box-and-whisker plots, indicating the median value (black bar inside box), 25th 
and 75th percentiles (bottom and top of box, respectively), and minimum and 
maximum values (bottom and top whisker, respectively). 
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Extended Data Fig. 2| Basal cardiac structure and function after cell or 
zymosan injection in uninjured mice. a, Schematic of all experiments 
performed in this figure, in which C57BI/6J eight-week-old male and female 
mice received intracardiac injection of mTomato-labelled MNCs, 

Alexa 594-labelled zymosan or sterile saline and were assessed by 
echocardiography after two weeks. b-f, Echocardiography-measured 
fractional shortening (FS) percentage (b), heart rate (HR) as beats per minute 
(bpm) under isoflurane anaesthesia (c), left ventricular posterior wall thickness 
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in diastole (LVPWT;d) in millimetres (d), left ventricular end-systolic volume 
(LVESV) in microlitres (e) and left ventricular end-diastolic volume (LVEDV) in 
microlitres (f). All values in b-f were unchanged with injection of MNCs or 
zymosan versus Saline. All numerical data are summarized as box-and-whisker 
plots, indicating the median value (black bar inside box), 25th and 75th 
percentiles (bottom and top of box, respectively), and minimum and maximum 
values (bottom and top whisker, respectively). The number of mice for each 
group inb-fis indicated below each respective box-and-whisker plot. 
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Extended Data Fig. 3 | Characterization of inflammatory and functional 
effects of the intracardiac injection protocol. a, Schematic of experiments 
performed inb-e. Eight-week-old C57BI/6J (uninjured) male and female mice 
received intracardiac injection of sterile saline, or asham procedure in which 
the heart was exposed by thoracotomy but no intracardiac injections were 
done. Naive mice without surgery served as an additional control. 

b-e, Quantification via flow cytometry of immune cells from enzymatically 
dissociated hearts from the above groups of mice. Total CD11b* myeloid cells 
(b), CD11b*Ly6C"*" (c) or CD11b*Ly6C'™ (d) monocytes, and CD11b*F4/80* 
macrophages (e) from n=4 mice per group are shown, normalized tothe 
starting weight of dissociated tissue. *P< 0.05 versus naive and *P<0.05 versus 
sham by one-way ANOVA with Tukey’s post hoc test (exact Pvalues are shownin 
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the panel). f, Schematic of experiments performed in g-iin which eight-week- 
old male and female C57BI/6) mice received I-R injury followed by either 
intracardiac injection of saline or thoracotomy (sham) after one week, and then 
were assessed by echocardiography two weeks later. g-i, Fractional shortening 
(g), left ventricular end-systolic volume (h) and heart rate under isoflurane 
anaesthesia (i), as measured by echocardiography in the groups indicated. The 
number of mice for each group is indicated in each graph ing-i. All numerical 
data inthis figure are summarized as box-and-whisker plots, indicating the 
median value (black bar inside box), 25th and 75th percentiles (bottom and top 
of box, respectively), and minimum and maximum values (bottom and top 
whisker, respectively). 
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Extended Data Fig. 4 | Additional echocardiographic parameters and effect 
of treatment with MNCs killed by freezing and thawing, after I-Rinjury. 
a-c, Echocardiography to measure left ventricular end-systolic volume in 
microlitres (a), left ventricular end-diastolic volume in microlitres (b) or heart 
rate under isoflurane anaesthesia (c) in mice that received an intracardiac 
injection of MNCs, CPCs, zymosan orsterile saline, three weeks after I-R. These 
data were collected in the same group of mice shown in Fig. 3b, d. Datainaare 
significantly different, as assessed by one-way ANOVA with Dunnett’s post hoc 
test (exact Pvalues are shown in the panels). d, Schematic of experiment in 
which mice received an intracardiac injection of mfomato* MNCs or MNCs 
killed by freezing and thawing one week after I-R, and then two weeks later the 
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cardiac ventricular fractional shortening percentage was measured by 
echocardiography. Exact P values are shown in the panel versus I-Rand saline, 
which involved one-way ANOVA with Dunnett’s post hoc test to examine 
significance. The sham, I-Rand saline, andI-R and MNC groups shown hereind 
are the same dataas also shown in Fig. 3, as these studies were performed in 
parallel. Allnumerical data are summarized as box-and-whisker plots, 
indicating the median value (black bar inside box), 25th and 75th percentiles 
(bottom and top of box, respectively), and minimum and maximum values 
(bottom and top whisker, respectively). The number of mice for each group is 
indicated belowthe respective plot. 
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Extended Data Fig. 5 | Characterization of cell types labelled from Ccr2- 
RFPx Cx3cr1-GFP knock-in mice. a, Schematic of experiments in which hearts 
from male and female double-heterozygous Ccr2-RFP x Cx3cr1-GFP knock-in 
mice (n=3) were isolated and analysed by flowcytometry at 4 months of age. 
b, Representative flow cytometry plots with the gating strategy shown, in 
which singlet cells were first selected by forward and side scatter properties 
followed by gating on endogenous GFP and RFP fluorescence for 
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CX3CRI'CCR2 (GFP*RFP ) or CX3CR1*CCR2* (GFP*RFP*) cells. c-f, Cells within 
the GFP*RFP or GFP*RFP* gates as shown in b were then assessed for 

surface marker expression via antibodies. CD64*CD169* macrophages (c), 
CD11b*CD11c* dendritic cells (d) or CD11b*Ly6G* neutrophils (e) are shownasa 
percentage ofall GFP*RFP’ or GFP*RFP* cells. f, Ly6c positivity in the total 


GFP*RFP” or GFP*RFP* populations is also shown. All data are represented as 
the mean+s.e.m. fromn=3 mice. 
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Extended Data Fig. 6 | Response of Cx3cr1 global gene-deleted mice to 
myocardial infarction injury. a, Schematic of experiments in which eight- 
week-old male and female Cx3cr1-GFP heterozygous (het.) or Cx3cr1-GFP/GFP 
homozygous (knockout, KO) mice received myocardial infarction (MI) via 
permanent occlusion of the left coronary artery and were then followed out for 
12 weeks. b,c, Representative confocal immunohistochemistry images from 
hearts of mice described ina at one day, three days or four weeks after 
myocardial infarction, showing endogenous GFP fluorescence from the Cx3cr1 
knock-in allele. Immunohistochemistry for activated CD68 macrophages 
(white) is also shown inb. Micrographs are representative of n=3 mice per time 
point. Scale bars, 100 pm. d, Quantification via flow cytometry of total 


1d post-MI; infarct border zone 


+CD68 ame : 
3d post-MI; whole heart 


[] Cer2RFP/+ Cx3cr1SFP* (Het.) 
[J Cer2*/* Cx3cr1SFP/GFP (KO) 


de eT 


CD11b*t Ly6G* 
neutrophils macrophages 


t border Z 


P+CD68 Zh 


— Control (n=26) 
— Cx3cr1 KO (n=29) 


°5 10 20 30 40 50 60 70 80 90 


CD64* 


monocyte, neutrophil and macrophage levels in dissociated whole hearts from 
Cx3cr1 heterozygous or knockout mice at three days after myocardial 
infarction. Data are from n=11 (heterozygous) or n=12 (knockout) mice. 
Numerical data are summarized as box-and-whisker plots, indicating the 
median value (black bar inside box), 25th and 75th percentiles (bottom and top 
of box, respectively), and minimum and maximum values (bottom and top 
whisker, respectively). e, Survival curve for Cx3cr1-GFP/GFP mice versus 
controls over 12 weeks (x axis shows days) after myocardial infarction injury. 
Control mice in this experiment included both Cx3crl-GFP heterozygous and 
C57BI/6J animals. The number (n) of mice for each group is indicated inthe 
figure. *P= 0.0473 by two-sided Gehan-Breslow-Wilcoxon test. 
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Extended Data Fig. 7 | Mechanical and structural improvements in cell- 
therapy- or zymosan-treated hearts after I-R. a, Change in passive force 
generation over increasing stretch lengthening (per cent of Ly) inisolated 
infarct strips from zymosan- or saline-injected hearts, analysed three weeks 
after surgical injury (injection of zymosan or saline occurred two weeks before 
collecting from mice). Exact Pvalues are shown in the panel, which were 
determined by unpaired two-tailed t-test. The I-R and saline, and untreated 
control, data shown here are the sameas in Fig. 4, as these studies were 
performed in parallel. Numerical data are summarized as box-and-whisker 
plots, indicating the median value (black bar inside box), 25th and 75th 
percentiles (bottom and top of box, respectively), and minimum and maximum 


values (bottom and top whisker, respectively), from the number (n) of mice 
indicated in the panels. b, Representative confocal micrographs from heart 
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histological sections from Ccr2-RFP x Cx3cr1-GFP mice at ten days after I-R, 
showing the infarct border zone versus remote myocardium, and labelled with 
a biotin-conjugated CHP that detects immature or denatured collagen. CHP 
labelling was detected witha streptavidin-conjugated Alexa 647 secondary 
antibody (purple). c, Representative confocal micrographs of heart 
histological sections from the post-I-R border zone of Ccr2-RFP x Cx3cr1-GFP 
mice that received intracardiac injection of MNCs or zymosan at seven days 
after I-R, analysed after an additional three days. Endogenous RFP (red) or GFP 
(green) fluorescence shows CCR2* or CX3CRI1* macrophages, respectively. 
Sections were treated with CHP (purple) asin b to visualize immature collagen 
and areas of active remodelling versus areas in which subtypes of macrophage 
differentially localize. Images in b, care representative of n=2 mice per group. 
Scale bars, 100 pm. 
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Life Sciences Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form is intended for publication with all accepted life 
science papers and provides structure for consistency and transparency in reporting. Every life science submission will use this form; some list 
items might not apply to an individual manuscript, but all fields must be completed for clarity. 


For further information on the points included in this form, see Reporting Life Sciences Research. For further information on Nature Research 
policies, including our data availability policy, see Authors & Referees and the Editorial Policy Checklist. 


Please do not complete any field with "not applicable" or n/a. Refer to the help text for what text to use if an item is not relevant to your study. 
For final submission: please carefully check your responses for accuracy; you will not be able to make changes /ater. 


u Experimental design 
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Describe how sample size was determined. Sample sizes were chosen based on extensive prior experience of the lab in characterizing 
cardiac injury models in the mouse, including observed post-operative surgical mortality rates 
for myocardial infarction and ischemia/reperfusion injury. 


2. Data exclusions 


Describe any data exclusions. Data exclusions were only made in the case of animals that either did not survive a given 
surgical procedure (intra-cardiac injection and/or myocardial infarction with reperfusion; |/R) 
or were insufficiently reperfused at the time of surgery (failure of surgical slipknot release). In 
these cases, any previously recorded data from these animals (for example at timepoints 
prior to death), were not excluded. 


3. Replication 


Describe the measures taken to verify the reproducibility | The results of all in vivo experiments were reproducible as shown across multiple animals 

of the experimental findings. (exact n values indicated in the text and figures) over multiple surgical cohorts. Cohorts of 
mice for intra-cardiac injection of cells/zymosan included age, strain, and sex-matched 
controls (saline injection) for comparison. 


4. Randomization 


Describe how samples/organisms/participants were All experimental groups were allocated to provide a roughly equal mix of males and females. 
Randomization of mice within a group to receive a given surgical procedure (I/R or MI vs 
sham) or treatment (saline vs cells vs zymosan) was not needed because the mice within a 
given genotype or treatment group were genetically identical and were littermates. 


allocated into experimental groups. 


5. Blinding 


Describe whether the investigators were blinded to group. Full blinding during intra-cardiac injection and/or I/R injury and collection of 

allocation during data collection and/or analysis. echocardiographic data was not performed due to logistical issues of animal handling and 
post-operative care. Analysis of echocardiographic data, all histological analyses, gene 
expression analysis, and analysis of tissue passive force properties were performed by 
investigators blinded to experimental treatment or procedure. 


Note: all in vivo studies must report how sample size was determined and whether blinding and randomization were used. 
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6. Statistical parameters 


For all figures and tables that use statistical methods, confirm that the following items are present in relevant figure legends (or in the 
Methods section if additional space is needed). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement (animals, litters, 
cultures, etc.) 


A description of how samples were collected, noting whether measurements were taken from distinct samples or whether the same 
sample was measured repeatedly 


x| A statement indicating how many times each experiment was replicated 


The statistical test(s) used and whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


x| A description of any assumptions or corrections, such as an adjustment for multiple comparisons 


Test values indicating whether an effect is present 
Provide confidence intervals or give results of significance tests (e.g. P values) as exact values whenever appropriate and with effect sizes noted. 


x| Aclear description of statistics including central tendency (e.g. median, mean) and variation (e.g. standard deviation, interquartile range) 
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x| Clearly defined error bars in all relevant figure captions (with explicit mention of central tendency and variation) 


See the web collection on statistics for biologists for further resources and guidance. 


ui Software 


Policy information about availability of computer code 


7. Software 
Describe the software used to analyze the data in this Quantitation of immunohistochemistry images was performed using NIS Elements v4.50 
study. (Nikon Instruments). Quantitation of histological images of fibrosis was performed using 


Adobe Photoshop CSS. Analysis of echocardiographic data was performed using VevoLAB 
v1.7.1. (VisualSonics). Analysis of analytical flow cytometry data was performed using FlowJo 
V.10 (Tree Star, Inc). Acquisition of passive tissue force measurements was performed using 
DMC v600A (Aurora Scientific). For statistical analysis, two-tailed t-tests (Figures 4d, 4e) were 
performed using Microsoft Excel, Office365 version. All other tests, including one-way 
analysis of variance (ANOVA), survival curve analysis, and normality tests, were performed 
using GraphPad Prism version 8.0.1. 


For manuscripts utilizing custom algorithms or software that are central to the paper but not yet described in the published literature, software must be made 
available to editors and reviewers upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). Nature Methods guidance for 
providing algorithms and software for publication provides further information on this topic. 


ui Materials and reagents 


Policy information about availability of materials 


8. Materials availability 


Indicate whether there are restrictions on availability of Kit-MerCreMer x R-GFP mice used for lineage tracing will be made available under a materials 

unique materials or if these materials are only available transfer agreement (MTA) between the requesting institution and Cincinnati Children's 

for distribution by a third party. Hospital. All other materials and reagents were obtained from commercial sources (see 
Online Methods for full description). 
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Antibodies 


Describe the antibodies used and how they were 
validated for use in the system under study (i.e. assay and 
species). 


All antibodies used in this study are commercially available. Antibody validation was 
performed by the individual manufacturer; validation information can be found on the 
manufacturers’ web pages via the links provided below. Please also see Supplementary Table 
1 and Online Methods for further description of antibodies, including dilutions used. Staining 
of samples in this study was also compared to unstained or secondary antibody-only controls 
processed in the same way. 


Individual validation statements: 


Abcam: https://www.abcam.com/primary-antibodies/improving-reproducibility-with-better- 
antibodies 


Thermo Fisher: https://www.thermofisher.com/us/en/home/life-science/antibodies/ 
invitrogen-antibody-validation.html 


Novus: https://www.novusbio.com/5-pillars-validation 
BioLegend: https://www.biolegend.com/reproducibility 


eBioscience: https://www.thermofisher.com/us/en/home/life-science/antibodies/invitrogen- 
antibody-validation.html 


Assay: Immunohistochemistry 


CD68 - Figures 1b, 1c, 3i, Extended Data Figure 6b. 
Supplier: Abcam, #ab53444 

onoclonal, clone FA-11 
Lot: GR3241798-1 


anufacturer’s validation information: https://www.abcam.com/cd68-antibody-fa-11- 
ab53444. pdf 


Dilution used here: 1:100 


Detected with: 

Goat anti-rat IgG Alexa Fluor 488 (Figures 1b, 1c) 
Supplier: Life Technologies, #A-11006 

Lot: 1887148 


anufacturer’s validation information: https://assets.thermofisher.com/TFS-Assets/LSG/ 
certificate/Certificates-of-Analysis/1887148_A11006.pdf 


Dilution used here: 1:200 


or 


Goat anti-rat IgG Alexa Fluor 647 (Figure 3i, Extended Data Figure 6b) 
Supplier: Life Technologies, #A-21247 
Lot: 2043368 
anufacturer’s validation information: https://assets.thermofisher.com/TFS-Assets/BID/ 
certificate/Certificates-of-Analysis/A21247%20Lot%202043368%20CofA.pdf 
Dilution used here: 1:200 


i-67 - Figures 2b, 2c. 

Supplier: Thermo Fisher, #14-5698-82 
onoclonal, clone SolA15 

Lot: 4328926 


anufacturer’s validation information: https://www.thermofisher.com/order/genome- 
database/dataSheetPdf? 
producttype=antibody&productsubtype=antibody_primary&productld=14-5698-82&version= 
65 


Dilution used here: 1:100 


Detected with: 

Goat anti-rat IgG Alexa Fluor 647 
Supplier: Life Technologies, #A-21247 
Lot: 2043368 


anufacturer’s validation information: https://assets.thermofisher.com/TFS-Assets/BID/ 
certificate/Certificates-of-Analysis/A21247%20Lot%202043368%20CofA.pdf 
Dilution used here: 1:200 


PCM1- Figures 2b, 2c. 

Supplier: Novus Biologicals, #NBP1-87196 

Polyclonal 

Lot: F117419 

anufacturer’s validation information: https://www.novusbio.com/PDFs/NBP1-87196.pdf 
Dilution used here: 1:100 


Detected with: 
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Life Sciences Reporting Summary (Antibodies section did not have enough room) 
9. Antibodies 

Describe the antibodies used and how they were validated 

for use in the system under study (i.e. assay and species). 


All antibodies used in this study are commercially available. Antibody validation was performed by the 
individual manufacturer; validation information can be found on the manufacturers’ web pages via the 
links provided below. Please also see Supplementary Table 1 and Online Methods for further description 
of antibodies, including dilutions used. Staining of samples in this study was also compared to unstained 
or secondary antibody-only controls processed in the same way. 


Individual validation statements: 

Abcam: https://www.abcam.com/primary-antibodies/improving-reproducibility-with-better-antibodies 
Thermo Fisher: https://www.thermofisher.com/us/en/home/life-science/antibodies/invitrogen- 
antibody-validation.html 

Novus: https://www.novusbio.com/5-pillars-validation 

BioLegend: https://www.biolegend.com/reproducibility 

eBioscience: https://www.thermofisher.com/us/en/home/life-science/antibodies/invitrogen-antibody- 
validation.html 


Assay: Immunohistochemistry 


CD68 - Figures 1b, 1c, 3i, Extended Data Figure 6b. 
Supplier: Abcam, #ab53444 

Monoclonal, clone FA-11 

Lot: GR3241798-1 


Manufacturer’s validation information: https://www.abcam.com/cd68-antibody-fa-11-ab53444. pdf 
Dilution used here: 1:100 


Detected with: 

Goat anti-rat IgG Alexa Fluor 488 (Figures 1b, 1c) 

Supplier: Life Technologies, #A-11006 

Lot: 1887148 

Manufacturer’s validation information: https://assets.thermofisher.com/TFS- 
Assets/LSG/certificate/Certificates-of-Analysis/1887148 A11006.pdf 
Dilution used here: 1:200 


or 


Goat anti-rat IgG Alexa Fluor 647 (Figure 3i, Extended Data Figure 6b) 

Supplier: Life Technologies, #A-21247 

Lot: 2043368 

Manufacturer’s validation information: https://assets.thermofisher.com/TFS- 
Assets/BID/certificate/Certificates-of-Analysis/A21247%20Lot%202043368%20CofA. pdf 
Dilution used here: 1:200 


Ki-67 - Figures 2b, 2c. 


Supplier: Thermo Fisher, #14-5698-82 

Monoclonal, clone SolA15 

Lot: 4328926 

Manufacturer’s validation information: https://www.thermofisher.com/order/genome- 
database/dataSheetPdf?producttype=antibody&productsubtype=antibody primary&productid=14- 
5698-82&version=65 

Dilution used here: 1:100 


Detected with: 

Goat anti-rat IgG Alexa Fluor 647 

Supplier: Life Technologies, #A-21247 

Lot: 2043368 

Manufacturer’s validation information: https://assets.thermofisher.com/TFS- 
Assets/BID/certificate/Certificates-of-Analysis/A21247%20Lot%202043368%20CofA. pdf 
Dilution used here: 1:200 


PCM1- Figures 2b, 2c. 

Supplier: Novus Biologicals, ¥#NBP1-87196 
Polyclonal 

Lot: F117419 


Manufacturer’s validation information: https://www.novusbio.com/PDFs/NBP1-87196.pdf 
Dilution used here: 1:100 


Detected with: 

Goat anti-rabbit IgG Alexa Fluor 488 

Supplier: Life Technologies, #A-11008 

Lot: 1853312 

Manufacturer’s validation information: https://assets.thermofisher.com/TFS- 
Assets/LSG/certificate/Certificates%200f%20Analysis/1853312 A11008.pdf 
Dilution used here: 1:200 


CD31 — Figures 2e, 2f. 

Supplier: BD Biosciences, #553370 

Monoclonal, clone MEC 13.3 

Lot: 6196687 

Manufacturer’s validation information: http://www.bdbiosciences.com/ds/pm/tds/553370.pdf 
Dilution used here: 1:100 


Detected with: 

Goat anti-rat IgG Alexa Fluor 647 

Supplier: Life Technologies, #A-21247 

Lot: 2043368 

Manufacturer’s validation information: https://assets.thermofisher.com/TFS- 
Assets/BID/certificate/Certificates-of-Analysis/A21247%20Lot%202043368%20CofA. pdf 
Dilution used here: 1:200 


Collagen Hybridizing Peptide, Biotin Conjugate (B-CHP) - Extended Data Figures 7b, 7c 
Supplier: 3Helix #BIO300 


Biotin-conjugated peptide 
Manufacturer’s validation information: https://www.3helix.com/wp- 
content/uploads/2019/04/3Helix CHP user guide 20190326.pdf 


Dilution used here: 15uM per tissue section 


Detected with: 

Alexa Fluor 647 Streptavidin 

Supplier: Life Technologies, #S-21374 

Lot: 1990312 

Manufacturer’s validation information: https://www.thermofisher.com/document-connect/document- 
connect.html?url=https%3A%2F%2Fassets.thermofisher.com%2FTFS- 
Assets%2FBID%2Fcertificate%2FCertificates-of- 


Analysis%2FS21374%2520L0t%25201990312%2520CofA. pdf&title=MTkSMDMxMg== 
Dilution used here: 1:200 


Assay: Flow Cytometry 


CD3e, CD11b, CD45R, Ly-6G, or Ter119 (eBioscience #88-7774-75, as a biotinylated panel). Extended 
Data Figure 1a. 
CD3e: Lot # E02344-1633 
CD11b: Lot # 4283483 
CD45R: Lot # 4291976 
Ly6G: Lot # E03072-1632 
Ter199: Lot # 4283598 
Supplier: eBioscience, #88-7774-75 
Manufacturer’s validation information: https://assets.thermofisher.com/TFS-Assets/LSG/manuals/88- 


7774.pdf 
Dilution used here: 1:50 for CD3e, 1:100 for each of the other antibodies 


Detected with: 

Streptavidin-conjugated Brilliant Violet 421 

Supplier: BD Biosciences, #563259 

Lot: 6077614 

Manufacturer’s validation information: http://www.bdbiosciences.com/ds/pm/tds/563259.pdf 
Dilution used here: 1:200 


Brilliant Violet 421™ anti-mouse CD11b - Extended Data Figure 3b-e. 

Supplier: BioLlegend, #101251 

Monoclonal 

Lot: B260574 

Manufacturer’s validation information: https://www.biolegend.com/en-us/products/brilliant-violet-421- 
anti-mouse-human-cd11b-antibody-7163 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore. 


Brilliant Violet 510™ anti-mouse Ly6c - Extended Data Figure 3c-d 
Supplier: BioLlegend #128033 
Monoclonal 


Lot: B251849 

Manufacturer’s validation information: https://www.biolegend.com/en-us/products/brilliant-violet-510- 
anti-mouse-ly-6c-antibody-8726 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


Brilliant Violet 421™ anti-mouse CD64 (FcyRI) - Figure 1f, Extended Data Figure 5c, Extended Data Figure 
6d. 

Manufacturer: BioLegend, #139309 

Monoclonal 

Lot: B254355 

Manufacturer’s validation information: https://www.biolegend.com/en-us/products/brilliant-violet-421- 
anti-mouse-cd64-fcgammari-antibody-8992 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


APC anti-mouse F4/80 [BMB8] - Figure 1f, Extended Data Figure 3e. 

Manufacturer: BioLegend, #123116 

Monoclonal 

Lot: B268075 

Manufacturer’s validation information: https://www.biolegend.com/en-us/products/apc-anti-mouse-f4- 
80-antibody-4071 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


eFluor660 CD169 (Siglec-1) - Extended Data Figure 5c. 

Manufacturer: Thermo Fisher Scientific, #50-5755-82 

Monoclonal 

Clone: SER-4 

Lot: 1917798 

Manufacturer’s validation information: https://www.thermofisher.com/order/genome- 
database/dataSheetPdf?producttype=antibody&productsubtype=antibody primary&productld=50- 
5755-82&version=65 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


APC anti-mouse CD11b - Extended Data Figure 5d-e, Extended Data Figure 6d. 

Manufacturer: eBioscience, #17-0112-82 

Monoclonal 

Clone: M1/70 

Lot: 4339583 

Manufacturer’s validation information: https://www.thermofisher.com/order/genome- 
database/dataSheetPdf?producttype=antibody&productsubtype=antibody primary&productld=17- 
0112-82&version=65 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


BUV395 Hamster Anti-Mouse CD11c - Extended Data Figure 5d. 


Manufacturer: BD Biosciences, #564080 

Monoclonal 

Clone: HL3 

Lot: 8206832 

Manufacturer’s validation information: http://www.bdbiosciences.com/ds/pm/tds/564080.pdf 
Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


Brilliant Violet 421™ anti-mouse LY-6G - Extended Data Figure 5e, Extended Data Figure 6d. 
Manufacturer: BD Biosciences, #562737 

Monoclonal 

Clone: 1A8 

Lot: 7270577 

Manufacturer’s validation information: http://www.bdbiosciences.com/ds/pm/tds/562737.pdf 
Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


Brilliant Violet 421™ anti-mouse Ly6c - Extended Data Figure 5f. 

Manufacturer: BioLegend, #128031 

Monoclonal 

Clone: HK 1.4 

Lot: B266457 

Manufacturer’s validation information: https://www.biolegend.com/en-ie/products/brilliant-violet-421- 
anti-mouse-ly-6c-antibody-8586 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


Pacific Blue anti-mouse Ly-6A/E (Sca-1) - Extended Data Figure 1b. 

Manufacturer: BioLegend, #108120 

Monoclonal 

Clone: D7 

Lot: B181741 

Manufacturer’s validation information: https://www.biolegend.com/en-us/global-elements/pdf- 
opup/pacific-blue-anti-mouse-ly-6a-e-sca-1-antibody-3140 


Dilution used here: 1:100 
Visualized directly with conjugated fluorophore 


CD45-APC - Extended Data Figure 1b. 

Manufacturer: BD Biosciences, #559864 

Monoclonal 

Clone: 30-F11 

Lot: 6203964 

Manufacturer’s validation information: http://www.bdbiosciences.com/ds/pm/tds/559864.pdf 
Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 


Brilliant Violet 421™ anti-mouse CD31 [390] - Extended Data Figure 1b. 
Manufacturer: BioLegend, #102423 


Monoclonal 

Clone: 390 

Lot: B231773 

Manufacturer’s validation information: https://www.biolegend.com/en-us/products/brilliant-violet-421- 
anti-mouse-cd31-antibody-8599 

Dilution used here: 1:100 

Visualized directly with conjugated fluorophore 
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The metabolic pathways encoded by the human gut microbiome constantly interact 
with host gene products through numerous bioactive molecules’. Primary bile acids 
(BAs) are synthesized within hepatocytes and released into the duodenum to facilitate 
absorption of lipids or fat-soluble vitamins”. Some BAs (approximately 5%) escape 
into the colon, where gut commensal bacteria convert them into various intestinal 
BAs? that are important hormones that regulate host cholesterol metabolism and 
energy balance via several nuclear receptors and/or G-protein-coupled receptors**. 
These receptors have pivotal roles in shaping host innate immune responses’. 
However, the effect of this host-microorganism biliary network on the adaptive 
immune system remains poorly characterized. Here we report that both dietary and 
microbial factors influence the composition of the gut BA pool and modulate an 
important population of colonic FOXP3* regulatory T (T,,,) cells expressing the 
transcription factor RORy. Genetic abolition of BA metabolic pathways in individual 
gut symbionts significantly decreases this T,,, cell population. Restoration of the 
intestinal BA pool increases colonic RORy’T,., cell counts and ameliorates host 
susceptibility to inflammatory colitis via BA nuclear receptors. Thus, a pan-genomic 
biliary network interaction between hosts and their bacterial symbionts can control 
host immunological homeostasis via the resulting metabolites. 


FOXP3'T,. cells residing in the gut lamina propria are critical in regulat- 
ing intestinal inflammation®”’. A distinct T,., cell population expressing 
the transcription factor RORy is induced inthe colonic lamina propria 
by colonization with gut symbionts* ®. Unlike thymic T,,,.cells, colonic 
RORy’T,.g cells have a distinct phenotype (Helios and NRPI ), and their 
accumulation is influenced by enteric factors derived from diet or 
commensal colonization®”. We hypothesized that intestinal bacteria 
facilitate the induction of RORY’ T,.g cells by modifying metabolites 
resulting from the host’s diet, and analysed RORy*Helios’ cells in the 
colonic T,,, cell population from specific pathogen-free (SPF) mice 
fed different diets—that is, a nutrient-rich diet or a minimal diet (Sup- 
plementary Table 1). We compared RORYy’T,,, cells in these groups 
with those in germ-free (GF) mice fed a nutrient-rich diet (Fig. 1a). Both 
minimal-diet SPF mice and rich-diet GF mice had a lower number of 
RORy‘Helios cells in the colonic T,,., cell population than did rich- 
diet SPF mice (Fig. la and Extended Data Fig. 1a). The effect of diet on 
Tyeg cell homeostatic proportions was limited to the colon and was 
not observed in other regions of the intestinal tract (the duodenum, 
jejunum or ileum) or in other lymphoid organs (the thymus, spleen, 
lymph nodes or Peyer’s patches) (Extended Data Fig. 1b, c). When diets 
were switched from minimal to nutrient-rich, colonic RORY’ T,,, cell fre- 
quency was reversed (Extended Data Fig. 1d-f). This finding suggested 
that dietary components or their resulting products biotransformed 
by the host and its bacterial symbionts are probably responsible for 


the induction of colonic RORy’T,., cells. Consistent with our previous 
findings’, we determined that, in our mouse colony, short-chain fatty 
acids (SCFAs) alone appear to be irrelevant to the accumulation of 
colonic RORY’ T,. cells (Extended Data Fig. 1g-j). 

As intestinal BAs are important metabolites affected by the host’s diet 
and modified by gut bacteria, we checked the intestinal BA contents of 
these mice (Supplementary Table 2). The level of murine conjugated 
primary BAs (the taurine-conjugated species of cholic acid, cheno- 
deoxycholic acid, muricholic acids and ursodeoxycholic acid (TCA, 
TCDCA, TMCAs and TUDCA, respectively)") was greatly reduced in 
the faeces of minimal-diet SPF mice (Fig. 1b). Not surprisingly, the GF 
mice had accumulated conjugated primary BAs (Fig. 1b). However, 
along with the reduced RORy’T,,, cell population, the levels of faecal 
deconjugated primary BAs and secondary BAs were significantly lower 
in both minimal-diet SPF mice and rich-diet GF mice than in rich-diet 
SPF mice (Fig. Ic, d). These results suggested that host BAs generated 
in response to diet and biotransformed by bacteria may induce colonic 
RORY’ Tyeg Cells. 

To directly test whether BA metabolites regulate colonic RORY’ T,eg 
cells, we supplemented the drinking water of minimal-diet SPF 
mice with either individual or combinations of BAs (Fig. 1e). Neither 
individual primary nor secondary BAs rescued the counts of colonic 
RORY’ T, eg cells or total FOXP3* T,., cells. However, mixtures of 
certain murine primary BAs (cholic/ursodeoxycholic acids, 


‘Department of Immunology, Blavatnik Institute, Harvard Medical School, Boston, MA, USA. ?Center for Experimental Therapeutics and Reperfusion Injury, Department of Anesthesiology, 
Perioperative and Pain Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA. 9UCB Pharma, Slough, UK. “Present address: Department of Cell Biology and 
Cancer Science, Rappaport Faculty of Medicine, Technion Integrated Cancer Center, Technion-lsrael Institute of Technology, Haifa, Israel. These authors contributed equally: Xinyang Song, 


Ximei Sun. *e-mail: dennis_kasper@hms.harvard.edu 


410 | Nature | Vol577 | 16 January 2020 


a @ SPF-rich diet © SPF-minimal diet b @™ SPF-rich diet HB SPF-minimal diet 
Se Cee n=”) S$ sg0, 7° (n=5) (n= 5) + 
SPF- SPF- GF- ®@ GF-rich diet ro} 430 @@ GF-rich diet * 
rich diet minimal diet rich diet #604. “4 &  330-p* (n= 6) e f 
9 9 9 KiL e 35 8 
46.8% 21.7% 9.6% © iol em . 8 
OX GO 
=O ee tik ee 
Parone ° =? 
= > 20, = ie ok 
Of ° bd a2 
© 104 eatee 2 
Helios-FITC i = 
8 
Cy d 
5 1,410, Ml SPF-rich diet H@ SPF-minimal diet 2,350- Ml SPF-rich diet H@ SPF-minimal diet 
® 4110+ (n=5) (n=5) 9 » 1,850 (n =5) (n=5) % 
= — 8197 wm GF-rich diet 3 — 1,350} BM GF-rich diet a 
8 @ 5104 Oo 8 *850+4 
>% d19l (n=6) 4 8B S51 (n=6) B 
6 9 210+ a 28 1007 
E* t504 ° na ae : 
a2 904 Se i: f z> 754 ry 
Ba aol & o: Ae og : a 
Som 307 Sm 804 
3= 20) F.. B= 95] * 
S10 +bee ce leer |: len 2 Pe “pi tt ny ren = @ ba” 
3 : ‘a ‘a Na ‘a ‘a : ‘a ‘a 
a er o) @) @ o) (/ gES © SEER © SUE © SED © SEE © SEE © EE © 0 
os & . if a a of o i oo on Ss 
of J a’ ss Vv Ss 
e SPF- SPF- ° a° oY 
SPF- minimal diet minimal diet & 60, . 
minimal diet CA alone CDCA alone Es 504 Primary BAs pa Secondary BAs 
21.0% 20.1% 19.6% ig & 40 ? 3 i 3 ae 
=O e e e 
® i 304 @ ee ee «6 ¢€ 4 > 
2a wed? 3 =, e° oe $ Sate & e 
eS 20) tiv v 23 gv. vy Sette 
oO 104 
=” 9 
SPF- 60,5 © FT 2 O0FCCHKTKDOSODTHN OOH KR DOH OOOD 
SPF- SPF- minimal diet Seoff es ss sees eet ee eee eee egceeg 
minimal diet minimal diet Oxo-BAs/ 2 a e ie tek 
UDCA alone CA/CDCA/UDCA _LCA/DCA Da ae ot 8 fe 
a 304 e e e ee e 
21.2% | 38.0% 35.2% SO .  *#y 0 ¢ 
wr 204 ze 2 ) e $6 eon wvtce 
5) PTS LeS "Sey sy eters 
x e eee 
=e 0 T T T T T T T T T T T T T T T T T T T T r T T T T 
Richdiet + - - - - - = - = = = = = = = = = = = = = = = = = 
Minimaldiet - + + + + + + + + + + + + + + + + + + + + + + + + 
Helios-FITC PRE a SH FRR MP MP PS at ot 
elios Rees IS < es — oS SL FSi” Coe e — KS Ks es Ot 
SXP SHAS Sy oye OTM OO 
WA oT OER EP ne ve RCA: ci FS 
se FO ew Vs i e Vv &< eo? Ny 
os OP v Vv Saw 
% 
aS 
& v 


Fig.1|Gut BA metabolites are essential for colonic RORy 'T,,, cell 
maintenance. a, Beginning at 3 weeks of age, three groups of mice were fed 
special diets for 4 weeks. SPF mice were fed either a nutrient-rich or a minimal 
diet, and GF mice were fed the nutrient-rich diet. Representative plots and 
frequencies of RORy‘Helios inthe FOXP3*CD4'TCR§’T,., cell population are 
shown. b-d, Liquid chromatography-mass spectrometry (LC-MS) 
quantification of faecal conjugated primary BAs (b), deconjugated primary 
BAs(c) and secondary BAs (d) from groups of mice fed asina. The BAs 
determined were cholic acid (CA), chenodeoxycholic acid (CDCA), 
ursodeoxycholic acid (UDCA), a-muricholic acid (aMCA), BMCA, deoxycholic 
acid (DCA), lithocholic acid (LCA), 3-0xo-CA, 3-0xo-LCA, 7-0xo-CA, 7-Oxo- 
CDCA, 12-oxo-CA, 12-0xo-DCA, wMCA and taurine-conjugated species (TCA, 


chenodeoxycholic/ursodeoxycholic acids or cholic/chenodeoxycholic/ 
ursodeoxycholic acids) restored colonic frequencies of both RORY’ Tyeg 
cells and FOXP3’ T,., cells close to that in rich-diet mice (Fig. le and 
Extended Data Fig. 2a, b). Similarly, a mixture of eight representative 
secondary BAs from bacterial oxidation and dehydroxylation path- 
ways significantly increased the number of colonic RORY’ T,,, cells 
and FOXP3*T,.. cells in minimal-diet mice (Fig. le and Extended Data 
Fig. 2a, b). We tested several more combinations of secondary BAs and 


TCDCA, TUDCA, TaMCA and TBMCA). e, Three-week-old SPF mice were feda 
nutrient-rich diet, a minimal diet, or a minimal diet supplemented with one or 
more primary or secondary BAs in drinking water for 4 weeks. Representative 
plots and frequencies of colonic RORy Helios’ in the FOXP3°CD4°TCRB' T,,, cell 
population and of colonic FOXP3* in the CD4'TCRB* cell population are shown. 
The BAs used in the feed were CA, CDCA, UDCA, DCA, LCA, 3-0xo-CA, 3-0xo- 
LCA, 7-0xo-CA, 7-0xo-CDCA, 12-0xo-CA, 12-0xo-DCA and various indicated BA 
combinations. Data are representative of at least two independent 
experiments ina-d. Data are pooled from three independent experiments ine. 
nrepresents biologically independent animals. Data are mean +s.e.m. 
*P<0.05,**P<0.01,***P< 0.001, one-way analysis of variance (ANOVA) followed 
by the Bonferroni post hoc test. 


found that only a combination of lithocholic/3-oxo-lithocholic acids 
restored colonic RORY’ T,,, counts (Fig. le). No similar effect of these 
BAs was found on colonic type 17 T helper (T,,17) cells (Extended Data 
Fig. 2c), or on RORY’ T,,,cells or T,,17 cells from the spleen, mesenteric 
lymphnode or small intestine (Extended Data Fig. 2d, e). These results 
indicated that certain primary and secondary BA species within a BA 
pool preferentially regulate RORY’ T,,, cells and that modulation of this 
T,-g cell population by BAs is both cell-type and tissue-type specific. 
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Fig. 2| Gut bacteria control colonic RORy'T,., cells through their BA 
metabolic pathways. a, Schematic diagram of BA metabolic pathwaysin 

B. thetaiotaomicron and B. fragilis. b, Each of the four groups of GF mice was 
colonized with one of the following microorganisms for 2 weeks: (1) a wild-type 
(WT) strain of B. thetaiotaomicron; (2) a BSH-mutant strain (BSH KO, in which 
both the BT_1259 and BT_2086 genes are deleted); (3) a 7a-HSDH-mutant strain 
(7a-HSDH KO, in which the BT_1911 gene is deleted); or (4) atriple-mutant strain 
(TKO; in which all three genes are deleted). Representative plots and 


frequencies of RORy‘Helios in the colonic FOXP3*CD4‘TCRB'T,,, cell 


As colonic BAs are derived mainly from bacterial biomodifica- 
tions”, we sought to determine how BA-producing bacteria regulate 
colonic RORY’ T,.. cells. Although the composition of the gut micro- 
biota reorganized in minimal-diet mice (Extended Data Fig. 3a-c), 
BA-metabolizing bacteria—for example, Bacteroidetes and Firmi- 
cutes—remained the dominant phyla within the minimal-diet mouse 
intestine (Extended Data Fig. 3d). Neither dietary alteration nor BA 
supplementation affected the level of recognized secondary BA-gen- 
erating bacteria—that is, Clostridium cluster IV or XIVa—in the colons 
of these mice (Extended Data Fig. 3e), which suggests that the decrease 
in the number of RORY’ T,., cells in minimal-diet mice is not due to the 
loss of BA-producing bacteria. Minimal-diet GF mice had lownumbers 
of colonic RORY’ T,,, cells, as did rich-diet GF mice (Extended Data 
Fig. 3f). Transferring the gut microbiota from minimal-diet SPF mice 
into rich-diet GF mice—but not transfer into minimal-diet GF mice—fully 
restored the frequencies of colonic RORY’ T,.g cells (Extended Data 
Fig. 3f). These results suggested that switching diet alone is insufficient 
to tune RORy’T,,, cells, while, in response to dietary stimuli, bacteria 
within the gut modulate this T,,, cell population. 

Several phyla and genera of human gastrointestinal bacteria can 
induce RORy’T,-gcells, and these same microorganism can also salvage 
conjugated BAs escaping from active transport inthe ileum and convert 
them into various BA derivatives”*’*. We hypothesized that BA meta- 
bolic pathways in these bacteria are involved in the induction of colonic 
RORY’ T,,, cells. We chose two RORy’T,.g cell inducers—Bacteroides the- 
taiotaomicron and Bacteroides fragilis—to test this hypothesis, as they 
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population are shown. c, Each of the four groups of GF mice was colonized 

with one of the following microorganisms for 2 weeks: (1) a wild-type strain 

of B. fragilis; (2) a BSH-KO strain (in which the BF638R_3610 gene is deleted); (3) 
a7a-HSDH-KO strain (in which the BF638R_3349 gene is deleted); or (4) a 
double-mutant strain (DKO; in which both genes are deleted). Colonic T,.. 

cells were analysed as in b. Data are pooled from three independent 
experiments in band c.nrepresents biologically independent animals. Data 
are mean+s.e.m. (bandc).***P< 0.001, one-way ANOVA followed by the 
Bonferroni post hoc test. 


harbour simple BA metabolic pathways and are genetically tractable” 
(Fig. 2a). Using the suicide vector pNJR6 for Bacteroides'®, we knocked 
out the genes that encode proteins involved in both BA deconjugation 
(bile salt hydrolase (BSH)) and oxidation of hydroxy groups at the C-7 
position (7a-hydroxysteroid dehydrogenase (7a-HSDH)) (Extended 
Data Fig. 4a, b). A genetic deficiency of the BA metabolic genes in these 
two species did not affect the ability of bacteria to colonize GF mouse 
colons (Extended Data Fig. 4c, d). Knockout of genes encoding BSH 
in these species altered the BA pool in monocolonized GF mice and 
impaired in vivo deconjugation of BAs (Extended Data Fig. 4e, f). These 
results were consistent with a recent report on the functions of BSHin 
Bacteroides’. Notably, we detected no BA level changes after deletion of 
the genes encoding 7a-HSDH, indicating that in these bacterial species, 
7a-oxidation may not have a major role in BA metabolism. Elimina- 
tion of BSH or the entire BA metabolic pathway in Bacteroides—atriple 
knockout (TKO) in B. thetaiotaomicron or a double knockout (DKO) in 
B. fragilis-dampened bacterial ability to induce colonic RORy’T,,.gcells 
(Fig. 2b, cand Extended Data Fig. Sa, c). By contrast, 7x-HSDH mutants 
elicited counts of colonic RORy’T,,., cells comparable to those induced 
by wild-type Bacteroides (Fig. 2b, c and Extended Data Fig. Sa, c). As 
Bacteroides do not harbour typical biotransformation pathways for 
secondary BA generation’, the decrease in the number of RORY’ T,.,.cells 
in BSH-mutant-associated GF mice indicated a direct role for primary 
BAs in regulating this T,., cell population. Deletion of BA biotransforma- 
tion pathways in Bacteroides did not alter the ability of these species 
to induce colonic total FOXP3'T,.g cells, colonic T,,17 cells (Extended 
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Fig. 3| BA metabolites modulate colonic RORy'T,,, cells via BARs. 

a, Quantitative mRNA expression of BARs in the colon of SPF mice. 

b, Frequencies of colonic RORY Helios in the FOXP3°CD4°TCRB’T,,,cell 
population from mice deficient in G-protein-coupled receptors (Gpbarl”, 
Chrm2‘-, Chrm3~ and S1pr2“) and their littermate controls. c, Representative 
plots and frequencies of colonic RORy“Helios in the FOXP3*°CD4'TCRP' T,.g cell 
population from mice deficient in nuclear receptors (Nrli2/-Nrli3-, NrIh3~-, 
Vdr'-, Nrih4 and Vdr“-Nrih4~), their littermate controls and minimal-diet 
Vdr'/*Nrih4*" mice. d, Three-week-old Vdr*Nrih4", Vdr-, Nrih4 and 
Vdr'Nrih4~ mice were fed a minimal diet or a minimal diet supplemented with 
primary BAs (CA/CDCA/UDCA, 2 mM of each) or secondary BAs (DCA/LCA/3- 
o0xo-CA/3-0x0-LCA/7-0xo-CA/7-0xo-CDCA/12-0x0-CA/12-0xo-DCA, 1mM of 
each) in their drinking water for 4 weeks. Frequencies of colonic RORy*Helios” 
inthe FOXP3*°CD4°TCR®' T,,, cell population are shown. e, Normalized 


Data Fig. 5b, d-h), or T,.g cells and T,,17 cells in the spleen, mesenteric 
lymph node and small intestine (Extended Data Fig. 5i-I). 

Host BAs are metabolically modified by microorganisms and used 
as signalling molecules, serving as ligands that activate BA receptors 
(BARs)*. Thus, we explored whether BARs modulate gut RORY’ T,., cell 
homeostasis. By comparing mouse colonic tissue expression of various 
BARs, we found that nuclear receptors, especially vitamin D receptor 
(VDR), were more abundant (Fig. 3a and Extended Data Fig. 6a). Next, 
we compared the numbers of colonic RORY’ T,.g cells in mice genetically 
deficient in each of several BARs. G-protein-coupled receptor defi- 
ciency did not affect the number of colonic RORY’ T,,, cells (Fig. 3b). The 
nuclear receptors NR1I2/3 (also known as PXR/CAR) and NR1H3 (also 
knownas LXRa) werealso non-contributory (Fig. 3c and Extended Data 
Fig. 6b). However, deficiency in either of the two BA-sensing nuclear 
receptors—VDR and NR1H4 (also known as FXR)””°—compromised the 


expression values of Vdrin colonic T.o, cells, FOXP3°T,,., cells, dendritic cells 
(DCs) and epithelial cells (Epith). f, Normalized expression values of Vdrin 
splenic Tyeg cells (Sp Treg), Colonic T, eg cells (Co T,-g), colonic RORY Treg cells 
(Co RORY T,,.,) and colonic RORy’T,.. cells (Co RORY’ T,..). 8, Frequencies of 
colonic RORy‘Helios” in the FOXP3*CD4°TCRB'T,,, cell population from 

Var fx Fox 3th ere, Var FxoxCa qc and VdrF" Vill mice and from 
Vdrf'/fx and Foxp3"“ mice. h, Volcano plots comparing transcriptomes 
of colonic T,,.,cells from Vdr*Foxp3"*” or Vd Foxp3™*” mice (n=3). 
Colonic T,,, cell signature genes are highlighted in red (upregulated) or blue 
(downregulated). Data are pooled from two or three independent experiments. 
nrepresents biologically independent animals. Dataare mean+s.e.m. 
**P<0.01,***P< 0.001, one-way ANOVA followed by the Bonferroni post hoc 
test (c,d and g) or x’ test (h). 


number of colonic RORy’T,,, cells (Fig. 3c and Extended Data Fig. 6b). 
The colonic RORY’ T,,g cell phenotype of mice with loss of both of these 
genes resembled that of mice with loss of only Vdr or in minimal-diet 
littermate controls (Fig. 3c and Extended Data Fig. 6b). This result indi- 
cated a prominent role for VDR signalling in modulating this colonic 
Tyeg cell population. That all BAR-deficient mice had normal colonic 
counts of total FOXP3°T,.. cells and T,,17 cells (Extended Data Fig. 6c-g) 
as well as RORy’T,,, cells in the spleen, mesenteric lymph node and 
small intestine (Extended Data Fig. 6h-n) indicated that BAR signalling 
control of RORY’ T,.g cells is tissue dependent. 

To determine whether intestinal BAs modulate colonic RORY’ Tyg 
cells via VDR or NR1H4, we treated nuclear receptor-deficient mice 
witha mixture of three murine primary BAs (cholic/chenodeoxycholic/ 
ursodeoxycholic acids) or a mixture of eight murine secondary BAs 
(deoxycholic acid/lithocholic acid/oxidized BAs). Both mixtures of 
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Fig. 4 | BAs ameliorate gut inflammation. a, Frequencies of RORy‘Helios in 
the colonic FOXP3*CD4"TCRB'T,,., cell population on day 2 of DSS-induced 
colitis in mice fed a nutrient-rich diet, a minimal diet, or a minimal diet 
supplemented with mixtures of primary or secondary BAs in drinking water. 
The primary BAs were CA, CDCA and UDCA (2 mM of each). The secondary BAs 
were DCA, LCA, 3-0xo-CA, 3-0xo-LCA, 7-0xo-CA, 7-0xo-CDCA, 12-0xo-CA and 
12-0xo-DCA (1mM of each). b, Daily weight loss of mice described ina during 
the course of DSS-induced colitis. c, d, Clinical scores (c) and haematoxylin and 


BAs were sufficient to rescue colonic frequencies of RORY’ T,., cells 
in minimal-diet NR1H4-deficient mice and their littermate controls 
(Fig. 3d). However, the failure of BA supplementation to restore colonic 
RORy’T,.g cell counts in both minimal-diet VDR-deficient mice and 
minimal-diet mice deficient in both VDR and NR1H4 (Fig. 3d) implied 
amajor role for the microbial BA-VDR axis in the regulation of colonic 
RORy’T,.¢ cell populations. 

As1,25-dihydroxyvitamin D, can also activate VDR”, we compared its 
levels in sera and colonic tissues from rich-diet and minimal-diet SPF 
mice and rich-diet GF mice. We found comparable levels among these 
three groups of mice (Extended Data Fig. 7a, b). We also found that, as 
previously reported’, dietary vitamin A deficiency led to a decrease 
inthe number of colonic RORY’ T,,g cells, while the absence of dietary 
vitamin D, did not havea similar effect (Extended Data Fig. 7c, d). These 
data indicated that the BA-VDR axis, not the vitamin D,-VDR axis, is 
crucial in modulating RORy’T,., cells in the gut. 

Next, we investigated which cell type responds to BA by affecting the 
upregulation of RORy’T,., cells. RNA sequencing (RNA-seq) performed 
oncolonic-sorted cell fractions revealed high Vdr expression in epithe- 
lial and dendritic cells but also in FOXP3’T,,, cells—at a higher level than 
inT conventional (T,,,y) cells (Fig. 3e). Indeed, Vdr expression is higher 
incolonicT,,,cells than in splenic T,,, cells, especially in RORy’T,., cells 
(Fig. 3f). This result is consistent with our earlier single-cell analysis 
highlighting VDR as atranscriptional regulator in colonic T,,, cells but 
not in other tissue T,, cells”. To directly assess relevance, we crossed 
Vdr fx conditional knockout mice with different Cre drivers to excise 
Vdr in distinct cell types. Colonic RORY’ T,,, cell counts were markedly 
reduced in Vdr?"xFoxp3"? mice, while loss of VDR in dendritic or 
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eosin-stained histological sections (d) for representative colons on day 10 of 
colitis (see b). e, Daily weight loss of Var" and Vari" Foxp3"? mice 
during the course of DSS-induced colitis. f, Clinical scores for representative 
colons on day 10 of colitis (see e). Data are representative of two or three 
independent experiments. n represents biologically independent animals. 
Data are mean+s.e.m. (a-c, eand f). *P< 0.05, **P< 0.01, ***P< 0.001, one-way 
ANOVA (aand c) or two-way ANOVA (bande) followed by the Bonferroni 

post hoc test, or two-tailed Student’s t-test (f). 


epithelial cells (Cd1Ic“’ and Vill; Cd1Ic is also known as /tgax) had no 
effect (Fig. 3g), demonstrating that VDR modulates colonic RORY’ T,.g 
cellsin anintrinsic manner. RNA-seq profiling of colonic T,,.,cells from 
VDR-deficient mice or control littermates showed a marked reduction 
in the transcriptional signature of colonic T,,, cells* in the absence of 
VDR (Fig. 3h); the RORy’T,,, cell-specific signature was also downregu- 
lated, as expected (Extended Data Fig. 8). These data suggest that VDR 
is important in determining the RORy-dependent program in colonic 
Treg cells. 

RORy'T,,, cells have been reported to maintain colonic homeostasis 
and minimize colitis severity" ”’. We investigated whether intestinal 
BAs modulate colonic inflammatory responses in a dextran sodium 
sulfate (DSS)-induced murine model of colitis. We saw no sign of inflam- 
mation in the colons of unchallenged mice fed a nutrient-rich diet, a 
minimal diet, or a minimal diet supplemented with primary or second- 
ary BA mixtures in drinking water (Extended Data Fig. 9a—c). However, 
at the onset of colitis, challenged minimal-diet mice had a reduced 
proportion of colonic RORY’ T,,, cells (Fig. 4a)—an indication that they 
might be predisposed to severe colitis. Indeed, as DSS-induced colitis 
progressed, minimal-diet mice lost more weight and experienced more- 
severe colitis than rich-diet mice (Fig. 4b-d). Notably, both primary 
and secondary BA supplementation increased RORY’ T,,, cell counts in 
minimal-diet mice (Fig. 4a) and alleviated their colitis symptoms and 
signs (Fig. 4b-d). However, after colitis onset, BA supplementation 
barely alleviated colitis in minimal-diet mice (Extended Data Fig. 9d, e), 
suggesting that maintenance of an RORY’ T,,, cell pool by BAs dur- 
ing homeostasis is crucial to host resistance to DSS colitis. We next 
explored whether the BA-VDR axis is involved in regulating colitis in 


mice. As in minimal-diet mice, deficiency of VDR in mice worsened 
the DSS-induced colitis phenotype (Extended Data Fig. 10a—c). The 
protective role of VDR signalling was also consistently observed in 
another mouse colitis model: naive CD4* T cell adaptive transfer into 
RAGI1-deficient mice (Extended Data Fig. 10d-g). Importantly, severe 
DSS-induced colitis developed in Vdr/°"""“Foxp3"""“" mice (Fig. 4e, f), 
implying that the BA-VDR axis has an intrinsic role in T,,, cell control 
of colonic inflammation. 

As important molecular mediators, intestinal BAs are critical in 
maintaining a healthy colonic RORy’ T,,, cell pool through BARs. Gut 
bacteria differ in the types and quantities of BA derivatives they can 
generate. For instance, across different phyla, many gut bacterial spe- 
cies harbour genes encoding BSH that are involved in the primary BA 
deconjugation process. As their regulation and substrate specificity 
may vary, both primary and secondary BA metabolic profiles of these 
bacteria are likely to be affected””». In addition, the microbial diversity 
of secondary BA metabolism adds another layer of complexity to BA 
derivative production in individual species that may ultimately affect 
their capacity for RORy’T,., cell induction. 

In view of the complexity of BA derivatives, our study suggests 
that dominant intestinal primary BA species—for example, cholic/ 
chenodeoxycholic/ursodeoxycholic acids—along with certain potent 
secondary BA species—for example, lithocholic/3-oxo-lithocholic 
acids—modulate RORY’ T,,, cells through the BAR VDR. Dysregulation 
of intestinal BAs has been proposed as a mediator of the pathogenesis 
of human inflammatory bowel diseases and colorectal cancers’™. The 
essential role of VDR in modulating peripheral RORy’T,,, cells and 
colitis susceptibility raises the interesting possibility that human VDR 
genetic variants associated with inflammatory bowel diseases* might 
affect disease susceptibility through improper control of the intes- 
tinal T,., cell pool. Mechanistically, it is intriguing to speculate that 
the nuclear receptor VDR—a colonic T,., cell-preferring transcription 
factor”'—may modulate colonic T,,, cell homeostasis by coordinating 
BA signals with transcription factor activity. An understanding of the 
molecular mechanisms underlying the regulation of colonic T,,, cells 
by this biliary network between hosts and their associated microorgan- 
isms will be valuable in improving therapy for human gastrointestinal 
inflammatory disorders. 
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Methods 


Mice and dietary treatments 

C57BL/6] wild-type mice were obtained from Jackson, as were Vdr~“, 
Nrih3, Nrih4”, Chrm2", Chrm3", Rag, Foxp3"", Cdl1c’, Vill” 
and Foxp3”*"? mice. Nr1i2’ Nrli3“ mice were obtained from Taconic. 
Gpbar1“ mice were purchased from KOMP Repository. SIpr2" mice 
were provided by T. Hla (Boston Children’s Hospital). Var"°* mice”® 
were provided by D. Gardner (University of California, San Francisco) 
and then crossed with Foxp3""", Cdl1c™ or Vill“ mice to generate the 
corresponding cell-type-specific knockout mice. Vdr/"Nr1h4" mice 
were obtained by crossing Vdr'/" with Nr1h4*" mice. Vdr'/*Foxp3™*” 
or Vdr’ Foxp3™*” reporter mice were generated by crossing Vdr'/” 
with Foxp3™*” reporter mice. Sterilized vitamin A-deficient, vitamin 
D,-deficient and control diets ona nutrient-rich diet background were 
purchased from TestDiet. For dietary treatment experiments with BA 
supplementation, 3-week-old C57BL/6) wild-type mice were fed either 
asterilized nutrient-rich diet (LabDiet 5K67) or a minimal diet (TestDiet 
AIN-76A) for 4 weeks. Some groups of the mice fed a minimal diet were 
also treated with various BAs (sodium salt form) in drinking water for 
4 weeks as described below. The primary BAs tested were cholic acid, 
chenodeoxycholic acid and ursodeoxycholic acid (2mM or 6mM ofa 
single BA); a mixture of cholic acid and chenodeoxycholic acid (3 mM of 
each, 6mM in total); a mixture of cholic acid and ursodeoxycholic acid 
(3mM ofeach, 6 mM in total); a mixture of chenodeoxycholic acid and 
ursodeoxycholic acid (3 mM of each, 6 mM in total); and a mixture of 
cholic acid, chenodeoxycholic acid and ursodeoxycholic acid (2 mM of 
each, 6mM in total). The secondary BAs tested were deoxycholic acid, 
lithocholic acid, 3-oxo-cholic acid, 3-oxo-lithocholic acid, 7-oxo-cholic 
acid, 7-oxo-chenodeoxycholic acid, 12-oxo-cholic acid and 12-oxo- 
deoxycholic acid (2 mM of a single BA); a mixture of deoxycholic acid 
and lithocholic acid (2 mM of each, 4 mM in total); a mixture of litho- 
cholic acid and 3-oxo-lithocholic acid (2 mM of each, 4 mM in total); a 
mixture of deoxycholic acid, 3-oxo-cholic acid, 7-oxo-cholic acid and 
12-oxo-cholic acid (2 mM of each, 8 mM in total); a mixture containing 
the oxo-BAs (1mM of each, 6 mM in total); and a mixture containing the 
oxo-BAs, deoxycholic acid and lithocholic acid (1mM of each, 8 mMin 
total). Three-week-old mice fed a minimal diet were also treated with 
SCFAs (sodium salt form) in drinking water. The SCFAs tested were 
acetate and propionate (36 mM or 150 mM of either SCFA); butyrate 
(36 mM or 100 mM); and a combination of acetate, propionate and 
butyrate (36 mM of each, 108 mM in total). Fresh drinking water was 
supplied each week. Pregnant C57BL/6J wild-type mice were also fed 
a sterilized nutrient-rich diet or a minimal diet during gestation and 
nursing, and their offspring were exposed to the same diet until wean- 
ing at approximately 3 weeks of age. The offspring were then fed the 
same diet as their dams or switched to the opposite diet for another 
4 weeks before phenotype analysis. Occasional growth restriction in 
mice fed a minimal diet necessitated their exclusion from all experi- 
ments. All mice were housed under the same conditions in SPF facilities 
at Harvard Medical School (HMS), and littermates from each mouse 
line were bred as strict controls. GF CS57BL/6 mice were obtained from 
the National Gnotobiotic Rodent Resource Center at the University of 
North Carolina at Chapel Hill and were maintained in GF isolators at 
HMS. Animal protocol ISO0000187-3 and COMS protocol 07-267 were 
approved by the HMS Institutional Animal Care and Use Committee 
and the Committee on Microbiological Safety, respectively. All animal 
studies were performed in compliance with the guidelines of ARRIVE. 


Anaerobic bacterial culture 

B. thetaiotaomicron VP1-5482 and B. fragilis 638R were grown either 
on basal medium (protease peptone, 20 g I; yeast extract, 5 gI7; and 
NaCl, 5 gl) supplemented with 0.5% glucose, 0.5% K,HPO,, 0.05% 
L-cysteine, 5 mg I“ haemin and 2.5 mg!‘ vitamin K,, or on BBL Brucella 
agar with 5% sheep blood, haemin and vitamin K,. The bacteria were 


then cultured under strictly anaerobic conditions (80% N,, 10% H, and 
10% CO,) at 37 °C in an anaerobic chamber. 


Genetical manipulation of bacteria 

Deletion mutants of B. thetaiotaomicron VPI-5482 were created by 
removal of the genes encoding BSHs (BT_1259 and BT_2086), the gene 
encoding 7a-HSDH (BT_1911) or all three genes. Deletion mutants of 
B. fragilis 638R were created by removal of the gene encoding BSH 
(BF638R_3610), the gene encoding 7a-HSDH (BF638R_3349) or 
both genes. DNA segments (1 kb) upstream and downstream of the 
region to be deleted were PCR amplified with the following primers: 
BT_1259 up forward: 5’-TTTGTCGACTTATATTT TCT TCCAAAACT-3’; 
BT_1259 up reverse: 5’-CAAGCTGCTATCAATAATTTCGATTTTT 
AGTTATA-3’; BT_1259 down forward: 5’-CTAAAAATCGAAATTA 
TTGATAGCAGCTTGCTGCA-3’; BT_1259 down reverse: 5’-TTTGG 
ATCCGGGAGGATTCCACATAATAT-3’; BT_2086 up forward: 5’-TTTGTCG 
ACCATCCAAACCCAGTGTGAAC-3’; BT_2086 up reverse: 5’-ATAACT 
AACTATCGAAT TACTTCCAAATTAAATAG-3’; BT_2086 down forward: 
5’-TAATTTGGAAGTAAT TCGATAGT TAGT TATGTGGT-3’; BT_2086 down 
reverse: 5’-TT TGGATCCAAGAGCATAAAGAGCTGTTG-3’; BT_1911 up 
forward: 5’-TTTGTCGACTAGGAAAAGAAAAAGTGATC-3’; BT_1911 up 
reverse: 5’-CTGICCGGGGTATATATATGT TGAGAATT TGATGA-3’; BT_1911 
down forward: 5’-AAAT TCTCAACATATATATACCCCGGACAGTACAT-3’; 
BT_1911 down reverse: 5’-TT TGGATCCAAT T TGATATAAGCGTACGA-3’; 
BF638R_3610 up forward: 5’-TTTGTCGACTATAGCTGGATGGCTG 
TTGC-3’; BF638R_3610 up reverse: 5’-CCTCGGTAGACATT TACTCTTT 
ATATTAAAATGGT-3’; BF638R_3610 down forward: 5’-TTTAATATAAAG 
AGTAAATGTCTACCGAGGCAGAT-3’; BF638R_3610 down reverse: 
5’-TTTGGATCCAGAAGAAGAGAT TGGTTCAC-3’; BF638R_3349 up for- 
ward: 5’-TTTGTCGACATCCCCGCCTGAGCAAGAAG-3’; BF638R_3349 
up reverse: 5’-T TGATAAGAT TCT T TAATAGGATGGT TT TGAGGAT-3’; 
BF638R_3349 down forward: 5’-CAAAACCATCCTAT TAAAGAATCTT 
ATCAAGTTAC-3’; BF638R_3349 down reverse: 5’-TTTGGATCCCC 
TGAGGGGTGGGAGAAACT-3’. PCR products from upstream and 
downstream regions were further fused together by fusion PCR to 
generate 2-kb PCR products, which were digested with BamHI and 
Sall. The digested products were cloned into the appropriate site of 
the Bacteroides suicide vector pNJR6. The resulting plasmids were 
conjugally transferred into Bacteroides strains by the helper plasmid 
R751, and Bacteroides strains integrated with the plasmids were selected 
by erythromycin resistance. The integrated Bacteroides strains were 
passaged for ten generations without erythromycin selection, and 
cross-out mutants were determined by PCR with primers targeting the 
two flanking regions of the indicated genes. PCR of an intact gene plus 
its flanking regions generated a product of approximately 1,150-1,500 
bp, while successful deletion of the gene of interest resulted ina PCR 
amplicon of only 350-450 bp of its two flanking regions. The DKO 
mutant of B. fragilis and the TKO mutant of B. thetaiotaomicron were 
created by subsequent deletion of the indicated genes. 


Generation of monocolonized mice 

GF C57BL/6J mice were orally inoculated by gavage with a broth-grown 
single bacterial strain at 4 weeks of age. Each group of mice was then 
maintained in a gnotobiotic isolator under sterile conditions for 2 
weeks. Faecal material was collected and plated 2 weeks after bacterial 
inoculation to determine colonization levels (colony-forming unit per 
gram of faeces) and to ensure colonization by a single bacterial strain. 


BA extraction and quantification 

Faecal contents from the distal colons of mice were collected and 
weighed before BA extraction”’. In brief, one weighed faecal pellet was 
resuspended with 500 ul sterile ddH,Oina1.5-ml screw-cap conical tube 
and sonicated for 10 min. The homogenates were centrifuged at 300g 
for 5 min, and the supernatants were transferred to a new tube. The 
remaining material was resuspended in 250 pl LC-MS-grade methanol 


containing internal standards and subjected to sonication for another 
10 min. The homogenates were also centrifuged at 300g for 5 min and 
the supernatants were collected and combined, after which another 
500-pl volume of sterile ddH,O was added to make a final 1.25-ml solu- 
tion. The solution was acidified with acetic acid (Sigma) and then loaded 
ontoamethanol-activated C18 column. The column was subsequently 
washed with 1 ml of 85:15 (vol/vol) H,O/methanol solution, andthe BAs 
were eluted with 2 ml of 25:75 (vol/vol) H,O/methanol solution. The 
eluted solutions were dried under nitrogen and resuspended in 200 
pl of 50:50 (vol/vol) H,O/acetonitrile solution. The samples were kept 
at -20 °C until analysed. Nineteen synthetic standards (taurocholic 
acid, taurochenodeoxycholic acid, tauroursodeoxycholic acid, cholic 
acid, chenodeoxycholic acid, ursodeoxycholic acid, lithocholic acid 
and deoxycholic acid from Sigma; tauro-a-muricholic acid, tauro-B- 
muricholic acid, B-muricholic acid and w-muricholic acid from Santa 
Cruz; and a-muricholic acid, 3-oxo-cholic acid, 3-oxo-lithocholic acid, 
7-oxo-cholic acid, 7-oxo-chenodeoxycholic acid, 12-oxo-cholic acid and 
12-oxo-deoxycholic acid from Steraloids) were run, and a calibration 
curve was generated for quantification. All standards were resuspended 
in LC-MS-grade acetonitrile and prepared in-phase for analysis. All sam- 
ples and standards were analysed by ultra-performance liquid chroma- 
tography coupled to electrospray ionization tandem mass spectrometry 
(UPLC-ESI-MS/MS; Thermo Scientific Orbitrap Q Exactive with Thermo 
Vanquish UPLC) with a Phenomenex Kinetex C8 (100 mm x 3 mm x 1.7 
pum) column; a 20-95% gradient of H,O-acetonitrile was used as eluent, 
with 0.1% formic acid as an additive. Mass spectral data were acquired 
by acycle of MS1 scan (range 300-650), followed by data-dependent 
MS/MS scanning (ddMS2) of the top 5 ions in the pre-generated inclu- 
sion list. D,-glycocholic acid and D,-deoxycholic acid served as internal 
standards for conjugated and unconjugated BAs, respectively. 


SCFA extraction and quantification 

Faecal contents from the distal colons of mice were collected and 
weighed before SCFA extraction. Faecal contents were resuspended 
in 75% acetonitrile containing three deuterated internal standards 
(D;-acetate, D;-propionate and D,-butyrate). Samples were sonicated 
for 10 min, vortexed and centrifuged at 8,000g for 5 min. Supernatant 
was collected and treated with activated charcoal to remove non-polar 
lipids by re-centrifuging. Supernatant was then collected, dried under 
nitrogen and resuspended in 95% acetonitrile solution. The samples 
were kept at —20 °C until analysed. All standards were resuspended 
in UPLC-grade methanol. For hydrophilic interaction liquid chroma- 
tography (HILIC)—-ESI-MS/MS analysis, a Waters BEH amide HILIC col- 
umn (2.1 mm x 100 mm x 2.5 pm) was used with a linear gradient of 
acetonitrile:H,O = 95:5 to 60:40 (vol/vol) with 2mM ammonium formate 
at pH 9.0. Deprotonated anion ([M—H]) of each SCFA (acetate, propi- 
onate and butyrate) was quantitated in negative-ion mode. 


MS data acquisition and processing 

Each individual species has been matched with its (1) accurate MS1 anion 
([M-H] or [M+ HCOO’]), (2) isotope ratio, and (3) LC retention time 
of authentic standard, for suitable identification and quantification 
of level 1 metabolite”®. Thermo Xcalibur Suite version 3.0 was used 
for peak identification and area integration as well as generation of 
a calibration curve for all synthetic standards. Raw data (integrated 
ion counts) were converted to absolute amounts by the calibration 
curve of individual species. Recovery of internal standards of individual 
samples was also calculated in same procedure. The calculated value 
was normalized by sample weight and the proportion of the injected 
sample for analysis to generate the desired final concentration (ug per 
gor pmol per g sample). 


Isolation of mouse lymphocytes and flow cytometry 
For isolation of lamina proprialymphocytes, colonic and small-intesti- 
nal tissues were dissected and fatty portions discarded. Peyer’s patches 


were also removed from the small intestines for isolation of lympho- 
cytes. The excised intestinal tissues were washed in cold PBS buffer, 
and epithelia were removed by 500-r.p.m. stirring at 37 °C in RPMI 
medium (Gibco) containing 1 mM EDTA (Ambion), 1mM dithiothreitol 
(Sigma) and 2% (vol/vol) FBS (GemBio). After 15 min of incubation, the 
epithelium-containing supernatants were discarded, and the remaining 
intestinal tissues were washed in RPMI medium with 5% (vol/vol) FBS, 
further minced into small pieces, and digested by 500-r.p.m. stirring 
at 37 °C in RPMI medium containing collagenase type II (1.5 mg mI}, 
Invitrogen), Dispase II (0.5 mg mI, Invitrogen) and 1.2% (vol/vol) FBS 
for 40 min. The digested tissues were filtered, and the solutions were 
centrifuged at 500g for 10 min to collect lamina propria cells. The 
pellets were resuspended, and the lamina propria lymphocytes were 
isolated by GE Healthcare Percoll (40%/80%) gradient centrifugation. 
For isolation of Peyer’s patch lymphocytes, the excised Peyer’s patches 
were digested in the same medium by 500-r.p.m. stirring at 37 °C for 
10 min, the digested Peyer’s patches were filtered, the solutions were 
centrifuged at 500g for 10 min and lymphocytes were collected. Lymph 
nodes, spleens and thymuses were mechanically disrupted. Single-cell 
suspensions were subjected to flow cytometric analysis by staining with 
antibodies against CD45 (30-F11, BioLegend), CD4 (GK1.5, BioLegend), 
TCR6 (H57-597, BioLegend), Helios (22F6, BioLegend), RORy (AFKJS-9, 
eBioscience) and FOXP3 (FJK-16s, eBioscience). For intracellular stain- 
ing of transcription factors, cells were blocked with an antibody against 
CD16/32 (2.4G2, BD Pharmingen) and stained for surface and viability 
markers. Fixation of cells in eBioscience Fix/Perm buffer for 50 min at 
room temperature was followed by permeabilization in eBioscience 
permeabilization buffer for 50 min in the presence of antibodies at 
room temperature. Cells were acquired with a Miltenyi MACSQuant 
Analyzer, and analysis was performed with FlowJo software. 


Murine colitis models and histology 

Three-week-old C57BL/6J wild-type mice were fed a sterilized nutrient- 
rich diet or a minimal diet for 4 weeks. Some groups of the mice fed a 
minimal diet were also pre-treated with a bile-salt mixture in drinking 
water for 4 weeks. The mice were next treated with 2.5% DSS (MP Bio- 
medicals) in drinking water for 5 days and then switched back to regular 
water or bile salt-containing water for another 5 days”. Vdr*"*, Vdr"", 
Var fx and Vdr2"xFoxp3"’ mice were treated with 2.5% DSS by 
the same protocol. The mice were weighed throughout the course of 
experimental colitis and were killed at specific time points, after which 
colonic tissue was obtained for histopathological and FACS analyses. 
For the T cell-adoptive transfer model of colitis”’, the indicated splenic 
naive T cells (TCRB*CD4*CD25 CD45RB") were sorted from 8-week- 
old Vdr** and Vdr’ mice by flow cytometry (Astrios, BD Biosciences). 
Cells (5 x 10° in 200 ul sterile PBS) were then intraperitoneally injected 
into RagI“ recipient mice. The mice were weighed throughout the 
T cell colitis model to assess their weight loss. Colonic tissues from 
both colitis models were dissected and immediately fixed with Bouin’s 
fixative solution (RICCA) for histological analyses. Paraffin-embedded 
sections of colonic tissue were stained with haematoxylin and eosin, 
and clinical scores were determined by light microscopy”. 


RNA isolation and quantitative PCR 

Total RNA was extracted from mouse colonic tissues with TRIzol rea- 
gent (Invitrogen) according to the manufacturer’s instructions. RNA 
samples were reverse-transcribed into cDNA witha TaKaRa PrimeScript 
RT Reagent Kit. The cDNA samples were amplified by quantitative PCR 
with a KAPA SYBR FAST qPCR kit on an Eppendorf Realplex? Master- 
cycler. The primers for quantitative PCR were as follow: Vdr forward: 
5’-CACCTGGCTGATCTTGTCAGT-3’; Vdr reverse: 5’-CTGGTCATCAGAGG 
TGAGGTC-3’; Nrih3 forward: 5’-TGTGCGCTCAGCTCTIGT-3’; Nrih3 
reverse: 5’-TGGAGCCCTGGACATTACC-3’; Nrih4 forward: 5’-GAAAATCC 
AATTCAGATTAGTCTTCAC-3’; Nrih4 reverse: 5’-CCGCGTGTTCTGTT 
AGCAT-3’; Nrii2 forward: 5’-TCTCAGGTGT TAGGTGGGAGA-3’; Nri2 
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reverse: 5’-GCACGGGGGTACAACTGT TA-3’; Nr1i3 forward: 5’-AAAT- 
GTTGGCATGAGGAAAGA-3’; Nr1i3 reverse: 5’-CTGAT TCAGT TGCAAA- 
GATGCT-3’; Gpbar1 forward: 5’-ATTCCCATGGGGGTTCTG-3’; Gpbar1 
reverse: 5’-GAGCAGGT TGGCGATGAC-3’; Chrm2 forward: 5’-AAAGGCTC- 
CTCGCTCCAG-3’; Chrm2 reverse: 5’-AGTCAAGTGGCCAAAGAAACA-3’; 
Chrm3 forward: 5’-ACTGGACAGTCCGGGAGATT-3’; Chrm3 reverse: 
5’-TGCCATTGCTGGTCATATCT-3’; S1pr2 forward: 5’-CCCAACTCCGG- 
GACATAGA-3’; SIpr2 reverse: 5’-ACAGCCAGTGGT TGGTTTTG-3’; 
Rpli3a forward: 5’-GGGCAGGT TCTGGTAT TGGAT-3’; Rpl13a reverse: 
5’-GGCTCGGAAATGGTAGGGG-3’. Expression of the indicated genes 
was normalized to the expression of housekeeping gene Rpl13a. 


16S rRNA profiling of the gut microbiota 

Faecal DNA was extracted from faecal pellets of the indicated mice 
with a QIAGEN QIAamp Fast DNA Stool Mini Kit (ref. no. 51604) accord- 
ing to the manufacturer’s instructions. The DNA samples were stored 
at -80 °C before processing. Purified DNA samples were quantified 
with a Qubit dsDNA HS Assay Kit (Invitrogen, Q32854) and normal- 
ized to 6ng pl", with subsequent amplification with barcoded primer 
pairs: 515f PCR forward primer: 5’-AATGATACGGCGACCACCGAGAT 
CTACACGCT TATGGTAATT GT GTGCCAGCMGCCGCGGTAA-3’; 
806r PCR reverse primer: 5’-CAAGCAGAAGACGGCATACGAGAT 
XXXXXXXXXXXXAGTCAGTCAG CC GGACTACHVGGGTWTCTAAT-3’ 
(with XXXXXXXXXXXX representing the barcodes). PCR products 
were quantified with the above Qubit kit and then combined into 
a pool library with equal mass to each sample. The pooled library 
was purified by AMpure beads (Beckman Coulter, A63880) and then 
subjected to agarose gel electrophoresis. A DNA band of approximately 
390 bp was dissected and purified with a QlAquick Gel Extraction 
Kit (Qiagen, 28704). The concentration of the pooled library was meas- 
ured and normalized to 10 nM. The purified pooled DNA library was 
qualified by an Agilent High Sensitivity DNA Chip on a bioanalyzer. 
The quantitated library was then subjected to multiplex sequenc- 
ing (Illumina MiSeq, 251 nt x 2 pair-end reads with 12-nt index reads). 
Raw sequencing data were analysed by QIIME2. In brief, the data were 
imported into QIIME2 and demultiplexed, a DADA2 pipeline was used 
for sequencing quality control, and a feature table was constructed with 
the following options: qiime dada2 denoise-paired—i-demultiplexed- 
seqs demux.qza—o-table table—o-representative-sequences rep-seqs— 
p-trim-left-f O—p-trim-left-r O—p-trunc-len-f 150—p-trunc-len-r 150. 
The feature table of the gut microbiota was used for alpha and beta 
diversity analysis as well as for taxonomic analysis and differential 
abundance testing. 


Faecal bacterial DNA extraction and quantitative PCR analysis 
Faecal DNA was extracted from the faecal pellets of indicated mice with 
a QIAGEN QIAamp Fast DNA Stool Mini Kit (51604) according to the 
manufacturer’s instructions. Quantitative PCR analysis was performed 
with an Eppendorf Realplex? Mastercycler anda KAPA SYBR FAST gPCR 
Kit. The 16S rRNA gene primers for real-time quantitative PCR were as 
follows”: total bacteria forward: 5’-GGTGAATACGTTCCCGG-3’; total 
bacteria reverse: 5’-TACGGCTACCTTGTTACGACTT-3’; Clostridium 
cluster IV forward: 5’-CCTTCCGTGCCGSAGTTA-3’; Clostridium clus- 
ter IVreverse: 5’-GAAT TAAACCACATACTCCACTGCTT-3’; Clostridium 
cluster XIVa forward: S’-AAATGACGGTACCTGACTAA-3’; Clostridium 
cluster XIVa reverse: 5’-CTTTGAGTTTCAT TCT TGCGAA-3’. The rela- 
tive quantity of the indicated Clostridium clusters was normalized to 
the total bacteria. 


RNA-seq and microarray analysis of splenic and colonic cells 

Mouse colonic lymphocytes form Vdr*/*Foxp3™*” or Vdr!Foxp3™*"? 
reporter mice were isolated as described above. Single-cell suspen- 
sions were blocked with an antibody against CD16/32 (2.4G2) and 
stained with antibodies against BioLegend CD45 (30-F11), CD4 (GK1.5), 
TCRB (H57-597), CD8a (53-6.7), CD19 (ID3/CD19), CD11c (N418), F4/80 


(BM8) and TER-119 (TER-119) and with viability dye. Live FOXP3* T,.. 
cells (CD45*CD19°CD11c F4/80 TER-119 TCRB*CD8a CD4*mRFP’) 
were double-sorted by flow cytometry (Astrios, BD Biosciences) 
to achieve 99% purity and directly lysed with TCL buffer (Qiagen, 
1031576) containing 1% 2-mercaptoethanol (Sigma-Aldrich, M6250). 
Colonic live T,.,, cells (CD45*CD19-CD11c F4/80 TER-119- TCRB'C 
D8a CD4*mRFP ) and CD11c* dendritic cells (CD45*CD19 TCRB~ 
TER-119 F4/80°CD11c*) were double-sorted at the same time. Colonic 
epithelial cells (CD45 EpCAM*‘) were also isolated as previously 
described”. Samples lysed in TCL buffer (Qiagen, 1031576) were 
kept on ice for 5 min, frozen by dye ice and stored at —80 °C before 
processing. Smart-Seq2 libraries for low-input RNA-seq were prepared 
by the Broad Technology Labs and were subsequently sequenced 
through the Broad Genomics Platform. In brief, total RNA was extracted 
and purified by Agencourt RNAClean XP beads (Beckman Coulter, 
A66514). mRNA was polyadenylated and selected by an anchored oligo 
(dT) primer and converted to cDNA by reverse transcription. First- 
strand cDNA was amplified by PCR followed by transposon-based frag- 
mentation with the Nextera XT DNA Library Preparation Kit (Illumina, 
FC-131-1096). Samples were then amplified with barcoded primers 
(Illumina PS and P7 barcodes) by PCR. Pooled samples were subjected 
to sequencing on an Illumina NextSeq500 (2 x 25-bp reads). Transcripts 
were quantified by the Broad Technology Labs computational pipeline 
(Cuffquant version 2.2.1°*). Normalized reads were further filtered 
and analysed by Multiplot Studio in the GenePattern software pack- 
age. The colonic T,,, cell signatures (fold change > 2 and P< 0.05) and 
RORY’ T,,, cell signatures (fold change > 1.5 and P< 0.05)° were used in 
the present study. Loss of VDR functional transcripts in all VDR-deficient 
cells was confirmed by analysis of a deletion of Vdr exon 3 transcripts 
in the cells™. 

Microarray analysis of splenic and colonic T,., cells was described in 
our previous report (GEO accession: GSE68009)°, and the expression 
of VDR within these populations is shown in the present study. 


ELISA of mouse 1,25-dihydroxyvitamin D, 

SPF mice were fed either a nutrient-rich or a minimal diet, and GF mice 
were fed the nutrient-rich diet as described above. Whole blood was 
collected in aserum separation tube and allowed to clot for 2h at room 
temperature. The serum was then collected by centrifugation at 10,000 
r.p.m. for 15 min. Colonic tissues were dissected, weighed and then 
homogenized in 500 pI PBS onice. The cells were lysed by freeze (liquid 
nitrogen)-thaw (room temperature) for three cycles, and the super- 
natants were collected by centrifugation at 13,000 r.p.m. for 15 min. 
Serum and colonic levels of 1,25-dihydroxyvitamin D, in the indicated 
mice were determined with a mouse 1,25-dihydroxyvitamin D, ELISA 
kit (LifeSpan BioSciences, LS-F28132) according to the manufacturer's 
instructions. 


Immunoblot analysis 

Mouse colonic tissues were directly lysed with Triton buffer (0.5% Triton 
X-100 and 20 mMHEPES, pH 7.6) on ice, and the lysates were separated 
by NuPAGE 4-12% Bis-Tris gel (Invitrogen). Separated proteins were 
transferred onto iBlot 2 NC Mini Stacks (Invitrogen) with an iBlot 2 Dry 
Blotting System (Invitrogen). The filters were then blocked with TBS 
with 5% non-fat dry milk (Rockland) plus 0.1% Tween 20 for1 hatroom 
temperature. After blocking, the filters were incubated overnight at 4 °C 
with primary antibodies (1:1,000 dilution) against VDR (sc-13133, Santa 
Cruz), NR1H4 (sc-25309, Santa Cruz), GPBAR1 (PA5-23182, Invitrogen) 
or actin (sc-8432, Santa Cruz), and were subsequently washed three 
times with TBS (Bio-Rad) containing 0.1% Tween 20 (Bio-Rad). The 
filters were then incubated with IRDye 680LT secondary antibodies 
(1:5,000 dilution) (LI-COR Biosciences) for 1 hat room temperature in 
the dark. After three washes with TBS containing 0.1% Tween 20, the 
indicated signals were detected with an Odyssey CLx Fluorescence 
Imaging System (LI-COR). 


Statistical analysis 

Results were shown as mean +s.e.m. Differences between groups were 
evaluated by analysis of variance followed by a Bonferroni post hoc 
test or by two-tailed Student’s t-test with 95% confidence intervals. To 
determine the enrichment of certain gene signatures in RNA-seq data- 
sets, we used ax’ test. P< 0.05 was considered statistically significant. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The microarray, RNA-seq and 16S rRNA profiling data are available in 
the NCBI database under accession numbers GSE68009, GSE1374.05 and 
PRJNA573477, respectively. The MS data are available in the Metabo- 
Lights database with the identifier MTBLS1276. 
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Extended Data Fig. 1| Both dietary and microbial factors control the number 
of colonic RORy'T,,, cells. a, Beginning at 3 weeks of age, three groups of mice 
were fed special diets for 4 weeks. SPF mice were fed either anutrient-richora 
minimal diet, and GF mice were fed the nutrient-rich diet. Colonic T,,, cells were 
analysed, and absolute numbers of RORy‘Helios’ in the FOXP3°CD4TCRB'T,. 
cell population are shown. b,c, Three-week-old SPF mice were fed as ina, and 
T,eg cells in different tissues were analysed after 4 weeks. Representative plots 
(b) and frequencies of RORy‘Helios’ in the FOXP3°CD4'TCRB’T,,, cell 
population (c) are shown. iLN, inguinal lymph node. d-f, SPF mice were feda 
nutrient-rich or a minimal diet at birth and were either maintained on that diet 
or switched to the opposite diet at 3 weeks of age. Colonic T,,.. cells were 
analysed after 4 weeks. Representative plots of RORy*Helios’ inthe 


FOXP3*CD4°TCRB’T,.. cell population (d), and the frequencies of FOXP3" in the 


CD4‘TCR®* cell population (e) and RORy‘Helios’ in the colonic 
FOXP3*CD4'TCRB' T,,. cell population (f) are shown. g-i, LC-MS quantification 
of faecal acetate (g), propionate (h) and butyrate (i) from SPF mice feda 
nutrient-rich or a minimal diet, and from GF mice fed a nutrient-rich diet. 

j, Three-week-old SPF mice were fed a nutrient-rich diet, a minimal diet, ora 
minimal diet supplemented with individual or mixed SCFAs in drinking water. 
Colonic T,,, cells were analysed after 4 weeks. Frequencies of RORy Helios in 
the FOXP3°CD4°TCR6’T,,, cell population are shown. Data are representative 
of two independent experiments. n represents biologically independent 
animals. Data are mean +s.e.m. (a,c and e-j).*P< 0.05, **P<0.01,***P<0.001, 
one-way ANOVA followed by the Bonferroni post hoc test (a, e-i) or two-tailed 
Student’s f-test (c). 
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Extended Data Fig. 2| Intestinal BAs regulate the number of colonic T,.. 
cells.a, b, Absolute numbers of RORy*Helios inthe colonic 
FOXP3*CD4'TCRB' T,.. cell population (a) and of FOXP3°T,,, cells in the 
CD4'°TCR®§* population (b) in SPF mice fed a nutrient-rich diet, a minimal diet, or 
a minimal diet supplemented with mixtures of primary or secondary BAsin 
drinking water. The primary BAs were CA, CDCA and UDCA (2 mM of each). The 
secondary BAs were 3-oxo-CA, 3-0xo-LCA, 7-0xo-CA, 7-0xo-CDCA, 12-0xo-CA, 
12-0xo-DCA, DCA and LCA (1mM of each). c, Three-week-old SPF mice were fed 
anutrient-rich diet, a minimal diet, or aminimal diet supplemented with one or 
more primary or secondary BAs in drinking water. Colonic T,,17 cells were 
analysed after 4 weeks. CA, CDCA, UDCA, DCA, LCA, 3-oxo-CA, 3-0xo-LCA, 
7-0xo-CA, 7-0xo-CDCA, 12-0xo-CA, 12-0xo-DCA and the indicated BA 


combinations were tested. Frequencies of RORy*FOXP3 inthe CD4*TCRB* 
cell population are shown. d, e, Three-week-old SPF mice were fed a nutrient- 
rich diet, a minimal diet, or a minimal diet supplemented with the indicated 
primary BAs (CA/CDCA/UDCA, 2 mM of each) or the secondary BAs (oxo-BAs/ 
LCA/DCA, 1mM of each) in drinking water. T,,, cells and T,,17 cellsin the spleen, 
mesenteric lymph node and ileum were analysed after 4 weeks. Frequencies of 
RORy‘Helios in the FOXP3°CD4'TCR®' T,., cell population (d) and of 
RORy*FOXP3 inthe CD4*TCR§#* cell population (e) are shown. Data are pooled 
from two or three independent experiments. n represents biologically 
independent animals. Data are mean +s.e.m. **P< 0.01, ***P< 0.001, one-way 
ANOVA followed by the Bonferroni post hoc test. 
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Extended Data Fig. 3 | Colonic microbial profiling of rich-diet mice versus 
minimal-diet mice. a-d, Three-week-old SPF mice were fed anutrient-rich ora 
minimal diet, and the microbial compositions in the coloniclumen were 
analysed after 4 weeks by 16S rRNA sequencing. Observed operational 
taxonomic units (OTUs) (a), Shannon index (b), principal coordinates analysis 
(PCoA) (c) and the relative abundance of bacteria at the phylum and family 
levels (d) are shown. e, Quantitative PCR analysis of 16S rDNA of Clostridium 
cluster IV and Clostridium cluster XIVa in colonic luminal specimens from SPF 
mice fed a nutrient-rich diet, a minimal diet, or a minimal diet supplemented 
with the indicated primary BAs (CA/CDCA/UDCA, 2 mM of each) or the 


secondary BAs (oxo-BAs/LCA/DCA, 1mM of each) in drinking water. f, Four- 
week-old GF mice or GF mice receiving transferred faecal materials (FMTs) 
from minimal-diet or rich-diet SPF mice were fed a nutrient-rich diet ora 
minimal diet, and colonic T,,, cells were analysed after 2 weeks. Frequencies of 
colonic RORy‘Helios in the FOXP3*°CD4TCR’T,., cell population are shown. 
Data are pooled from three independent experiments ina-d. Data are 
representative of two independent experiments ineandf.nrepresents 
biologically independent animals. Data are mean +s.e.m. ***P< 0.001, one-way 
ANOVA followed by the Bonferroni post hoc test. 
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Extended Data Fig. 4 | Generation of BA metabolic pathway mutantsin 
Bacteroides. a, Schematic diagram of pNJR6 suicide vector-mediated BA gene 
deletion in Bacteroides. b, Genotyping of B. thetaiotaomicron and B. fragilis BA 
metabolic pathway mutants by PCR. PCR primers were designed to target the 
flanking regions of an intact gene. PCR of an untouched gene plus its flanking 
regions generated a PCR product of around 1,150-1,500 bp, while deletion of an 
interested BA metabolic gene resulted in only an approximately 350-450-bp 
PCRamplicon ofits two flanking regions. c, d, Bacterial load (measured as 
colony-forming unit (CFU) per gram of faeces) of B. thetaiotaomicron (c) and B. 


fragilis (d) BA metabolic pathway mutants and their wild-type control strains in 
monocolonized GF mice. e, f, LC-MS quantification of faecal conjugated 
primary BAs (e) and deconjugated primary BAs (f) in GF mice monocolonized 
with B. thetaiotaomicron or B. fragilis BA metabolic pathway mutants and their 
wild-type control strains. Data are representative of two independent 
experiments in b, eand f. Data are pooled from three independent experiments 
incandd.nrepresents biologically independent animals. Data are 

mean +s.e.m. (c-f). 
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Extended Data Fig. 5 | Gut bacteria modulate colonic RORy'T,,, cells via their 
BA metabolic pathways. a, b, Each of four groups of GF mice was colonized 
with one of the following microorganisms: (1) a wild-type strain of B. 
thetaiotaomicron; (2) aBSH-mutant strain; (3) a 7a-HSDH-mutant strain; or (4) a 
triple-mutant (TKO) strain. Colonic T,,, cells were analysed after 2 weeks. 
Absolute numbers of RORy‘Helios in the colonic FOXP3*CD4TCRB"T,,, cell 
population (a) and of FOXP3'T,,, cells in the CD4°*TCRB* population (b) are 
shown. c,d, Each of four groups of GF mice was colonized with one of the 
following microorganisms: (1) a wild-type strain of B. fragilis; (2)aBSH-KO 
strain; (3) a 7a-HSDH-KO strain; or (4) adouble-mutant (DKO) strain. Absolute 
numbers of colonic T,,, cells are shown as inaand b.e, f, GF mice were colonized 
with B. thetaiotaomicron BA metabolic pathway mutants or their wild-type 
control strains. Colonic T,,, cells and T,,17 cells were analysed after 2 weeks. 
Frequencies of FOXP3* inthe CD4‘TCR$* cell population (e) or RORy*FOXP3 in 
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the CD4*TCR§* cell population (f) are shown. g, h, GF mice were colonized with 
B. fragilis BA metabolic pathway mutants or their wild-type control strains. 
Frequencies of colonic T,,, cells and T,,17 cells are shown asineandf. 

i,j, GF mice were colonized with B. thetaiotaomicron BA metabolic pathway 
mutants or their wild-type control strains. T,., cells and T,,17 cellsin the spleen, 
mesenteric lymph node and ileum were analysed after 2 weeks. Frequencies of 
RORy‘Helios’ in the FOXP3*CD4*TCRB’T,,, cell population (i) and 
RORy*FOXP3 inthe CD4‘TCRB* cell population (j) are shown. k, I, GF mice were 
colonized with B. fragilis BA metabolic pathway mutants or their wild-type 
control strains. Frequencies of T,., cells and T,,17 cells in the spleen, mesenteric 
lymph node and ileum are shownas iniandj. Data are pooled from two or three 
independent experiments. n represents biologically independent animals. 
Data are mean +s.e.m. *P< 0.05, **P<0.01, ***P< 0.001, one-way ANOVA 
followed by the Bonferroni post hoc test. 
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Extended Data Fig. 6 | The effect of BAR deficiency onT,,, cells or T,17 cellsin 
gut and peripheral lymphoid organs. a, Protein expression of VDR, FXR (also 
known as NR1H4) and GPBAR1 in the colonic tissue of SPF C57BL/6J mice was 
analysed by western blot. The red asterisks indicate the corresponding 
molecular weight of VDR (53 kDa), FXR (69 kDa) and GPBAR1 (33 kDa). For gel 
source data, see Supplementary Fig. 1. b,c, Absolute numbers of RORy Helios” 
in the colonic FOXP3*°CD4'TCR®’ T,-, cell population (b) and of FOXP3’T,., cells 
in the CD4‘TCRB* population (c) from mice deficient in nuclear receptors 

~ Vdr', Nrih4” and Vdr/-Nrih4") and their littermate 
controls. d, e, Frequencies of FOXP3’ in the colonic CD4°*TCRB* cell population 
from mice deficient in G-protein-coupled receptors (Gpbarl”, Chrm2‘, 
Chrm3™ and SIpr2”) and their littermate controls (d) and from mice deficient 


(Nrli2’ Nrli3™, Nr1h3™ 


innuclear receptors (Nrli2/-Nrli3”, Nrih3, Vdr’, Nrih4 and Vdr’ Nrih4") 
and their littermate controls (e). f, g, Frequencies of RORy*FOXP3 inthe 
colonic CD4‘TCR$* cell population from mice described ind ande. h-n, T,., 
cells inthe spleen, mesenteric lymph node and ileum from the indicated mice 
were analysed. Frequencies of RORy‘Helios’ inthe FOXP3*CD4*TCRB* 
population from Gpbari™ (h), Chrm2~ (i), Chrm3~ (j), SIpr2“" (k), 
Nrli2’-Nri3™ (1), Nr1h3~ (m), and Vdr“, Nrih4 and Vdr/-Nrih4~ (n) mice and 
their littermate controls are shown. Data are representative of two or three 
independent experiments ina and d-n, or are pooled from two or three 
independent experiments inbandc. nrepresents biologically independent 
animals. Data are mean + s.e.m. **P< 0.01, one-way ANOVA followed by the 
Bonferroni post hoc test. 
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Extended Data Fig. 7 | Dietary vitamin D, does not alter the frequency of 
colonic RORy'T,,, cells. a, b, Beginning at 3 weeks of age, three groups of mice 
were fed special diets for 4 weeks. SPF mice were fed either anutrient-richora 
minimal diet, and GF mice were fed the nutrient-rich diet. The levels of 


FOXP3*CD4“TCRB’T,., cell population are shown. d, SPF mice were feda 
nutrient-rich diet at birth and were either maintained on that diet or switched 
toarich diet deficient in vitamin D, or vitamin A at 3 weeks of age. Colonic Tye. 
cells were analysed after 4 weeks. Frequencies of RORy*Helios inthe colonic 
1,25-dihydroxyvitamin D, in serum (a) and the colon (b) of these mice were FOXP3*CD4'TCRB' T,,, cell population are shown. Data are representative of 
determined by ELISA. c, SPF mice were fed a nutrient-rich diet, or arich diet two independent experiments. n represents biologically independent animals. 
deficient in vitamin D, (VitD3) or vitamin A (VitA) at birth. ColonicT,,,cellswere | Dataaremeanés.e.m.***P<0.001, one-way ANOVA followed by the Bonferroni 
analysed after 7 weeks. Frequencies of RORy‘Helios inthe colonic post hoctest. 
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Extended Data Fig. 8 | Comparison of RORy 'T,,, cell signature genes of 
colonicT,,, cells from Vdr'" and Vdr“ mice. Volcano plots comparing 
transcriptomes of colonic T,,, cells from Vdr*Foxp3™” and Vdr’ Foxp3™"” 
mice (n=3). Colonic RORy’T,., cell signature genes are highlighted in red 
(upregulated) or blue (downregulated). The number of genes from each 
signature preferentially expressed by one or the other population is shown at 
the bottom. Data are pooled from two independent experiments. n represents 
biologically independent animals. To determine the enrichment of certain 
gene signatures in RNA-seq datasets, ax’ test was used. P< 0.05 was considered 


statistically significant. 
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Extended Data Fig. 9 | BAsupplementation does not cause gut inflammation 
and cannot ameliorate gut inflammation after the development of colitis. 
a-c, Three-week-old SPF mice were fed a nutrient-rich diet, aminimal diet, ora 
minimal diet supplemented with mixtures of primary BAs (CA/CDCA/UDCA, 2 
mM ofeach) or secondary BAs (oxo-BAs/LCA/DCA, 1mM of each) in drinking 
water. Initial body weights were recorded before DSS challenge (a). Clinical 
scores (b) and haematoxylin and eosin histology (c) for representative colons 
from mice not challenged with DSS are shown. d, e, Three-week-old SPF mice 
feda nutrient-rich or a minimal diet for 4 weeks were then challenged inthe 
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DSS-induced colitis model. After the development of colitis at day 5 of the 
model, the DSS containing drinking water was switched to regular drinking 
water or to drinking water supplemented with mixtures of primary or 
secondary BAs. The primary BAs were CA, CDCA and UDCA (2 mM of each). The 
secondary BAs were 3-0xo-CA, 3-0xo-LCA, 7-oxo-CA, 7-0xo-CDCA, 12-0xo-CA, 
12-0xo-DCA, DCA and LCA (1mM of each). Daily weight loss (d) of mice during 
the course of DSS-induced colitis and clinical scores (e) on day 10 of colitis are 
shown. Data are representative of two independent experiments. n represents 
biologically independent animals. Data are mean+s.e.m.ina,b,dande. 
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Extended Data Fig. 10| VDR signalling controls gut inflammation. a—c, Daily 


weight loss (a) of Vdr** and Vdr“ mice during the course of DSS-induced colitis. 


Clinical scores (b) and haematoxylin and eosin histology (c) of representative 
colons on day 10 of colitis are shown. d, Schematic representation of the T cell- 
adaptive transfer model of colitis. Either Vdr*"* or Vdr* naive T cells are 
transferred to Rag1” mice. e-g, Weight loss (e) of RagI’ mice ind during the 


Var*!* Naive T to Ragi“ Vdr* Naive T to Rag1™ 


course of T cell-adaptive transfer-induced colitis. Clinical scores (f) and 
haematoxylin and eosin histology (g) of representative colons on day 67 of 
colitis are shown. Data are representative of two independent experiments. 
nrepresents biologically independent animals. Dataare mean+s.e.m. ina,b,e 
and f.*P<0.05, **P< 0.01, ***P< 0.001, two-way ANOVA followed by the 
Bonferroni post hoc test (aande) or two-tailed Student’s ¢-test (band f). 
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Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


4) A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


CO) Uo 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection MACSQuantify™ Software was used to collect data from flow cytometry; Eppendorf MasterCycler was used to collect real-time PCR data; 
Thermo Scientific Orbitrap Q Exactive with Thermo Vanquish UPLC (UPLC-MS/MS) was used to collect MS data; Odyssey® CLx 
Fluorescence Imaging System was used to collect Immunoblot images; Zeiss Light Microscope was used to collect histological data; 
Illumina MiSeq was used to collect 16S rRNA profiling data; Illumina NextSeq500 was used to collect RNAseq data; BioTek® microplate 
reader was used to collect ELISA data. 


Data analysis Flow cytometric analyses were performed with the FlowJo 10.4.1; Thermo Xcalibur suite software version 3.0 was used to analysis the 
bile acid and SCFA data generated by UPLC-MS/MS (Thermo Scientific Orbitrap Q Exactive with Thermo Vanquish UPLC); 16S rRNA 
profiling data were analyzed by QIIME2; RNAseq data were analyzed by Cuffquant 2.2.1 and Multiplot Studio 1.5.60 in the GenePattern 
software package; ELISA data were obtained by BioTek® GenS software; Statistical analyses were performed with the GraphPad Prism 7.0 
and Excel 2016. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The authors declare that the data supporting the findings of this study are available within the paper and its supplementary information files. Accession codes will 
be available along with publication. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes were based on a combination of the resource equation method, limited availability of gnotobiotic mice, and prior lab experience 
to ensure statistical and biological significance. 


Data exclusions No data was excluded from the analysis. Occasional growth restriction in mice fed a minimal diet (AIN-76A) necessitated their exclusion from 
all experiments. The exclusion criteria for growth restricted mice were pre-established. 


Replication Replicate experiments were preformed successfully at least two to three times. The number of replicates for each experiment is listed in the 
figure legends. If group size was small (due to limited availability of mouse strains), data from replicate experiments were pooled for graphical 
representation. 


Randomization Mice were randomized between groups when feasible. Age- and sex-matched littermates from each mouse line were bred as strict controls. 


Blinding Investigators were not blinded to the experimental groups given different housing conditions required. Clinical scores of hematoxylin/eosin 
stained colonic tissue sections were determined by a pathologist who was blinded to the nature of the mice under analysis. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
[| Palaeontology |_| MRI-based neuroimaging 


Animals and other organisms 


[| Human research participants 


Clinical data 


Antibodies 


Antibodies used Following FACS antibodies were used in the study:PB-CD45 (30-F11, Biolegend, 103126), PE/Cy7-CD45 (30-F11, Biolegend, 
103114), BV605-CD4 (GK1.5, Biolegend, 100451), PerCP/Cy5.5-CD4 (GK1.5, Biolegend, 100434), PE/Cy7-TCR-B (H57-597, 
Biolegend, 109222), FITC-TCR-B (H57-597, Biolegend, 109206), FITC-Helios (22F6, Biolegend, 137214), PE-RORy (AFKJS-9, 
eBioscience™, 50-112-9700), APC-Foxp3 (FJK-16s, eBioscience™, 50-112-8936), APC/Cy7-Cd11c (N418, Biolegend, 117324), APC- 
CD8a (53-6.7, Biolegend, 100712), APC-CD19 (1D3/CD19, Biolegend, 152410), APC-F4/80 (BM8, Biolegend, 123116), APC- 
TER-119 (TER-119, Biolegend, 116212), PE-EpCAM (G8.8, Biolegend, 118206), PE-CD45RB (C363-16A, Biolegend, 103308), APC- 
CD25 (3C7, Biolegend, 101910), and anti-CD16/32 (2.4G2,BD Pharmingen™, 553142). 
Following WB antibodies were used in the study: Anti-VDR (sc-13133, Santa Cruz), Anti-FXR (sc-25309, Santa Cruz), Anti- 
GPBAR1 (PA5S-23182, Invitrogen), Anti-Actin (sc-8432, Santa Cruz), IRDye® 680LT Goat anti-Rabbit IgG Secondary Antibody (P/N: 
926-68021, LI-COR Biosciences), and IRDye® 680LT Goat anti-Mouse IgG Secondary Antibody (P/N: 926-68020, LI-COR 
Biosciences). 


Validation All the antibodies used in this study were validated by manufactures and used according to supplied instructions. The WB 
antibodies were re-validated by immunoblot performed in this study, and the FACS antibodies were determined by testing them 
with splenocyte staining. 
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Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting anima 


Note that full information on the approval o 


Laboratory animals 


Wild animals 
Field-collected samples 


Ethics oversight 


Flow Cytometry 


C57BL/6J wild-type mice were obtained from Jackson, as were Vdr—/-, Nr1h4—/-, 


research 


r1h3-/—, Chrm2-/-, Chrm3-/-, Rag1-/-, 


Foxp3YFP-cre, Cdiiccre, Villcre, and Foxp3mRFP mice. Nr1i2—/—Nr1i3—/— mice were obtained from Taconic. Gpbar1—/— mice 
were from KOMP Repository. S1pr2—/— mice were kindly provided by Dr. Timothy Hla at Boston Children's Hospital. Vdrflox/flox 
mice were kindly provided by Dr. David Gardner at University of California, San Francisco, and then crossed with Foxp3YFP-cre, 
Cd11ccre, or Villcre mice to generate the corresponding cell type specific knockout mice. Vdr—/—Nrih4—/— mice were obtained 
by crossing Vdr+/— with Nrih4+/— mice. Vdr—/-Foxp3mRFP or Vdr+/+Foxp3mRFP reporter mice were generated by crossing Vdr 


+/— with Foxp3mRFP reporter mice. All genetically modified mice and their contro 


mice were subjected to experiments at the 


ages of 6-8 weeks old. For dietary treatment experiments, 3-week-old C57BL/6) wild-type mice or genetically deficient mice 
were fed either a sterilized nutrient-rich diet (LabDiet 5K67) or a minimal diet (TestDiet AIN-76A) for 4 weeks. Some groups of 


the mice fed a minima 


diet were also treated with various BAs or SCFAs (sodium salt form) in drinking water for 4 weeks. GF 


C57BL/6J mice were orally inoculated by gavage with a broth-grown single bacterial strain or a collection of fecal material at 4 


weeks o 


The study did not involve wil 


d animals. 


age. Each group of mice was then maintained in a gnotobiotic isolator under sterile conditions for 2 weeks. 


The study did not involve samples collected from the field. 


Harvard Medical School Institutional Animal Care and Use Committee and the Committee on Microbiological Safety. 


the study protocol must also be provided in the manuscript. 
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Gating strategy 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


All plots are contour plots with outliers or pseudocolor plots. 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


A numerical value for number of cells or percentage (with statistics) is provided. 


For isolation of LP lymphocytes, colonic and small intestinal tissues were dissected and fatty portions discarded. PPs were also 
removed from small intestines for isolation of lymphocytes. The excised intestinal tissues were washed in cold PBS buffer, and 
epithelia were removed by 500-rpm stirring at 37°C in RPMI medium containing 1 mM EDTA, 1 mM DTT, and 2% (vol/vol) FBS. 


. 


tS 


FlowJo 10.4.1. 
All the cells were double sor 


The gating strategies used in 
1. FSC-A/SSC-A---FSC-A/FSC- 
Helios-FITC. 

2. FSC-A/SSC-A---FSC-A/FSC- 
3. FSC-A/SSC-A---FSC-A/FSC- 
4. FSC-A/SSC-A---FSC-A/FSC- 
5. FSC-A/SSC-A---FSC-A/FSC- 
high. 


ed by flow cytometry 


this study: 
H--- CD45-PB/Viability 


H--- CD45-PE/Cy7/Viab 
H--- CD45-PE/Cy7/Viab 


H--- CD45-PB/Viability 


After 15 min of incubation, the epithelium-containing supernatants were discarded, and the remaining intestinal tissues were 
washed in RPMI medium with 5% (vol/vol) FBS, further minced into small pieces, and digested by 500-rpm stirring at 37°C in 
RPMI medium containing collagenase type II (1.5 mg/ml), Dispase II (0.5 mg/ml), and 1.2% (vol/vol) FBS for 40 min. The digested 
issues were filtered, and the solutions were centrifuged at 500 g for 10 min in order to collect LP cells. The pellets were 
resuspended, and the LP lymphocytes were isolated by Percoll (40%/80%) gradient centrifugation. For isolation of PP 
mphocytes, the excised PPs were digested in the same medium by 500-rpm stirring at 37°C for 10 min, the digested PPs were 
tered, the solutions were centrifuged at 500 g for 10 min, and lymphocytes were collected. Lymph nodes, spleens, and 
thymuses were mechanically disrupted. 


iltenyi MACSQuant Analyzer and BD Biosciences MoFlo Astrios EQ. 


Astrios, BD Biosciences) to achieve 99% purity. 


Dye-APC/Cy7---TCR-B-PE/Cy7/CD4-PerCP/Cy5.5---Foxp3-APC---RORy-PE/ 


ility Dye-DAPI---TCR-B-FITC/CD4-BV605---Foxp3-mRFP+/-. 
ility Dye-DAPI---Cd11c-APC/Cy7. 


H--- CD45-PB/Cy7-/Viability Dye-APC/Cy7---EpCAM-PE. 


Dye-AmCycan---TCR-B-PE/Cy7/CD4-PerCP/Cy5.5---CD25-APC-/CD45RB-PE 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Humoral immune responses to immunization and infection and susceptibilities to 


antibody-mediated autoimmunity are generally lower in males’ *. However, the 
mechanisms underlying such sexual dimorphism are not well understood. Here we 
show that there are intrinsic differences between the B cells that produce germinal 
centres in male and female mice. We find that antigen-activated male B cells do not 
position themselves as efficiently as female B cells in the centre of follicles in 
secondary lymphoid organs, in which germinal centres normally develop. Moreover, 
GPR174—an X-chromosome-encoded G-protein-coupled receptor—suppresses the 
formation of germinal centres in male, but not female, mice. This effect is intrinsic toB 
cells, and correlates with the GPR174-enhanced positioning of B cells towards the 
T-cell-B-cell border of follicles, and the distraction of male, but not female, B cells 
from S1PR2-driven follicle-centre localization. Biochemical fractionation of 
conditioned media that induce B-cell migration in a GPR174-dependent manner 
identifies CCL21 as a GPR174 ligand. In response to CCL21, GPR174 triggers acalcium 
flux and preferentially induces the migration of male B cells; GPR174 also becomes 
associated with more Gai protein in male than in female B cells. Male B cells from 
orchidectomized mice exhibit impaired GPR174-mediated migration to CCL21, and 
testosterone treatment rescues this defect. Female B cells from testosterone-treated 
mice exhibit male-like GPR174—Gai association and GPR174-mediated migration. 
Deleting GPR174 from male B cells causes more efficient positioning towards the 
follicular centre, the formation of more germinal centres and an increased 
susceptibility to B-cell-dependent experimental autoimmune encephalomyelitis. By 
identifying GPR174 as a receptor for CCL21 and demonstrating its sex-dependent 
control of B-cell positioning and participation in germinal centres, we have revealed a 
mechanism by which B-cell physiology is fine-tuned to impart sexual dimorphism to 


humoral immunity. 


We began by comparing the formation of germinal centres by specific 
Bcells from male and female MD4 mice. These particular B cells recog- 
nize hen egg lysozyme (HEL). We transferred MD4 B cells into the same 
male host mice together with OT-II helper T cells, which recognize the 
major histocompatibility complex (MHC) molecule I-A” in complex 
with chicken ovalbumin-derived peptide (OVA,,;_339). Five days after 
immunization with HEL-OVA, female MD4 B cells madea substantially 
larger contribution to germinal centres than male cells (Extended Data 
Fig. 1a, b). Interestingly, at day 4 before visible germinal-centre for- 
mation, female MD4 cells were more concentrated in the follicular 
centre occupied by the follicular dendritic cell network, whereas male 
cells inthe same follicle were more dispersed across the entire follicle 


(Extended Data Fig. Ic, d), suggesting that differential follicle-centre 
localization may underlie the differential germinal-centre formation 
by male and female B cells. 

Aseries of guidance receptors—including the chemokine recep- 
tors CXCRS (refs. *°) and CCR7 (refs. ®”), the sphingosine-1-phosphate 
receptor 2 (SIPR2; ref. 8) and the G-protein-coupled receptor GPR183 
(refs. °!°)—orchestrates the sequential localization of antigen-activated 
Bcells during a germinal-centre response. However, we did not detect 
differences inthe expression of these receptors between the two sexes 
(Extended Data Fig. 2a). In a survey of additional G-protein-coupled 
receptors (GPCRs), we found that X-linked GPR174 was expressed 
by naive and germinal-centre B cells (Extended Data Fig. 2b). When 
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Gprt74"/-S1PR2 
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Dispersal index = 
Cell density in CD35* area 


Fig. 1|GPR174 regulates B-cell positioning and suppresses germinal-centre 
formation in male, but not female, mice. a, Splenic distribution patterns of 
mouse B cells retrovirally transduced with vector that expresses GFP alone or 
in combination with GPR174, 24 h after adoptive transfer. B220, the follicle; 
CD3, the T-cell zone. b, Summary statistics of the relative distance to the T-cell 
zone, defined as the ratio between the shortest distance from GFP’ Bcells to 
the T-zone edge (yellow dashed lines ina) and the maximum depth of the follicle 
(white lines), with negative values indicating T-zone locations. Each symbol 
denotes one B cell (n=218 and 232), with data pooled from three experiments. 
Scale bar, 50 um. c, Germinal-centre (FAS"'GL7") frequencies of transduced 
MD4 B cells five days after immunization. Each symbol denotes one mouse 
(n=8 mice), with data pooled from two experiments. d,e, Statistics for 
germinal centres (d) and spleen plasma cells (SPPCs) (e) in mice of the indicated 
genotypes and sexes seven days after SRBC immunization. Blue and cyan 
symbols, males; red and pink symbols, females. Each symbol denotes one 
mouse (n=14, 15,16 and 15 from left to right). Data pooled from four 
experiments. f, SRBC-specific IgG titres in sera from mice of the indicated 
genotypes and sexes 14 days after SRBC immunization. Each symbol denotes 
one mouse (n=12, 12, 13 and 13); data pooled from two experiments. 


retrovirally overexpressed, GPR174 promoted B-celllocalizationtowards 
the T-cell-B-cell border of secondary lymphoid organs (Fig. 1a, b) 
without increasing CCR7 expression (Extended Data Fig. 2c), and mark- 
edly inhibited germinal-centre formation (Fig. lc and Extended Data 
Fig. 3a). To test whether GPR174 differentially regulates germinal-centre 
responses in the two sexes, we examined germinal-centre formationin 
GPR174-deficient male and female mice. Seven days after immuniza- 
tion with sheep red blood cells (SRBCs), the size of germinal centres 
was lower in males than in females; GPR174 deficiency led toa marked 
recovery of the germinal-centre size in males but did not affect females 
(Fig. 1d and Extended Data Fig. 3b). Males had lower levels of splenic 
plasma cells and anti-SRBC immunoglobulin G (IgG) titres, which 
also recovered substantially in the absence of GPR174 (Fig. le, f and 
Extended Data Fig. 3c). Ina model of acute viral infection, GPR174 also 
differentially impinged on germinal-centre formation in different sexes 
(Extended Data Fig. 3d). Next, we constructed mixed bone-marrow 
chimaeras comprising 80% of bone-marrow cells from LMT mice (which 
cannot generate B cells) and 20% of wild-type or GPR174-deficient donor 
bone-marrow cells. One week after SRBC immunization, germinal- 
centre formation by Gpr174”’ B cells was higher than that by Gpr174"”" 
counterparts in male chimaeras, whereas female Gpr174”" and Gpr174" 
B cells did not differ in female chimaeras (Fig. 1g and Extended Data 
Fig. 3e). Splenic plasma cells and anti-SRBC titres followed the same 
trend (Fig. 1h, iand Extended Data Fig. 3f). Thus, GPR174 suppresses 


Gprt74-Y-S1PR2 
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g,h, Statistics of germinal centres (g) and SPPCs (h) in mixed bone-marrow 
chimeric mice seven days after SRBC immunization. Each symbol denotes one 
mouse (n=18, 19,18 and 18), with data pooled from four experiments. i, SRBC- 
specific IgG titres in sera of mixed bone-marrow chimaeras 14 days after SRBC 
immunization. Each symbol denotes one mouse (n=12,12,12 and 10), with data 
pooled from two experiments. j, Distribution patterns of B cells of the 
indicated genotypes and sexes, transduced to express control vector (top) or 
S1PR2 (bottom), 12 hafter transfer into sex-matched Bé6 recipients. Yellow lines 
outline follicle borders; white dashed lines outline CD35" follicular dendritic 
cell regions in each follicle; yellow circles highlight transferred B cells inthe 
IgD* CD35" area; white circles highlight transferred B cells inthe CD35‘ region. 
Scale bar, 50 um. k, Definition of the dispersal index. I,m, Summary statistics 
of dispersal indices for male (I) and female (m) cells. Each symbol represents 
one follicle (n=55, 58, 44, 48 biological replicates for males; n=58,49, 46,49 
for females). Data were pooled from two independent experiments. 

b-e, g-h, I-m, Short horizontal lines indicate mean. Two-tailed unpaired 
Student’s t-tests were used. f, i, Column height indicates median. Two-tailed 
Mann-Whitney U-tests were used. Pvalues are given in graphs; ****P< 0.0001. 


the intrinsic ability of male, but not female, B cells to form germinal 
centres and produce plasma cells. 

We observed no difference in the follicular distribution of naive B 
cells sufficient or deficient in GPR174, and no difference in localization 
to the T-cell-B-cell border following acute activation? (Extended Data 
Fig. 4). Upregulation of S1PR2 on activated B cells promotes follicle- 
centre localization before germinal-centre formation”, a process that 
can be mimicked with SIPR2 overexpression®. To examine whether 
endogenous GPR174 regulates this process in asex-dependent manner, 
we transduced S1PR2 into male and female B cells. Following transfer 
into B6 mice, wild-type male and female B cells concentrated towards 
the follicular centre comparably; by contrast, this process was exagger- 
ated by GPR174 deficiency for male, but not female, B cells (exemplified 
in Fig. 1j and quantified with a dispersal index in Fig. 1k-m). These data 
suggest that in male, but not female, mice endogenous GPR174 guides 
activated B cells away from the follicular centre, and thereby inhibits 
germinal-centre formation. 

Previous studies have suggested that lysophosphoserine (LysoPS) is 
a GPR174 ligand”, and that GPR174 plays a part in regulating the func- 
tions of regulatory T cells®. However, we found that LysoPS does not 
induce GPR174-dependent migration, whereas culture medium condi- 
tioned by acrude preparation of splenic stromal elements containeda 
strong chemoattractant activity for GPR174, irreplaceable by LysoPS 
(Extended Data Fig. 5a, b). Because this chemoattractant activity was 
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Fig. 2 | Sexually dimorphic GPR174-mediated B-cell migration to CCL21. 
a-d, Transwell migration in response to the indicated CCL21 concentrations by 
naive B cells (a, c) or B cells that were stimulated with anti-IgM (10 pg ml”) and 
anti-CD40 (10 pg mI) antibodies for 48 h (b, d), from GPR174-sufficient or 
-deficient littermates that were either unmanipulated (a, b) or 
gonadectomized, six weeks after surgery (c,d). The dashed line at 40% aids 
cross-panel comparison. ORX, orchidectomized; OVX, ovariectomized. Data 
represent three biological replicates for each condition from one of three 
independent experiments with similar results. Blue asterisks, wild-type versus 
GPR174-knockout male; red asterisks, wild-type versus GPR174-knockout 
female; green asterisks, wild-type male versus wild-type female. e, f, Summary 
statistics from the three experiments, showing GPR174-dependent migration 
of naive B cells (e) or activated B cells (f) in response to CCL21, calculated by 
subtracting the mean percentage migrated of GPR174-knockout B cells from 
the mean percentage migrated of corresponding wild-type B cells. Mean values 
from individual experiments are indicated with different symbols, and bar 
heights indicate averages of the three experiments. Blue asterisks, male versus 
male ORX; red asterisks, female versus female OVX; green asterisks, male 
versus female. g, Transwell migration of activated B cells that were isolated 
from GPR174-sufficient or -deficient male or GPR174-sufficient female 
littermate mice treated with either vehicle or testosterone. Data represent 
three biological replicates for each condition, from two independent 
experiments. Blue asterisks, male wild type versus knockout plus vehicle; red 
asterisks, female wild type vehicle versus testosterone. All statistical 
comparisons were made by two-way analysis of variance (ANOVA) with 
Bonferroni’s multiple comparisons; ****P< 0.0001. NS, not significant. 


lost after heating at 100 °C or incubation with proteinase K (Extended 
Data Fig. 5c), it is likely that GPR174 sensed a protein. GPR174 is highly 
conserved in mammalian species, and we confirmed that the mouse 
receptor responded to ligands from rat (data not shown) and porcine 
(Extended Data Fig. 5d) spleens. Subsequently, through a 6-step iso- 
lation procedure (Extended Data Fig. 5e), we concentrated from 16 
porcine spleens sufficient amounts of ligand activities for mass-spec- 
trometry-based protein identification (Extended Data Fig. 5f-k). Frac- 
tion5 from the final mono-S purification step contained the strongest 
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Fig. 3 | Sex and hormone dependence of CCL21-induced GPR174-Gai 
association. a, Left and centre, immunoblots (IBs) of lysates (left) or GFP 
immunoprecipitates (centre) of B cells isolated from male or female GPR174- 
GFP BAC transgenic mice, activated with anti-IgM and anti-CD40, and left 
untreated or stimulated with 300 ng mI‘ CCL21. Right, normalized quantities 
of indicated Ga proteins in co-immunoprecipitation (IP) products from three 
similar experiments, each represented by a different colour and a connecting 
line. b, Left and centre, immunoblots of lysates or GFPimmunoprecipitates of B 
cells from the indicated sources and treatments, as ina. Right, normalized 
quantities of indicated Ga proteins inco-immunoprecipitation products from 
two experiments, each represented by a different colour and aconnecting line. 
Gai-1 was probed in one of the two experiments. Vehicle, mice treated with 
sunflower seed oil; testosterone, mice treated with testosterone dissolved in 
sunflower seed oil. For gel source data, see Supplementary Fig. 1. 


chemoattractant activity and gave rise to bands with enhanced intensi- 
ties on silver-stained gels (Extended Data Fig. 5k-I). By mass spectrom- 
etry, we identified numerous peptides that collectively cover 50% of 
the porcine CCL21 sequence. 

To verify that CCL21 and potentially CCL19 are chemokine ligands 
for GPR174, we neutralized CCL21 and CCL19 in splenic-stroma-condi- 
tioned medium. We found that blocking either CCL21 or CCL19 reduced, 
and blocking both abrogated, the migration of GPR174-transduced B 
cells to stroma-conditioned medium (Extended Data Fig. 6a). GPR174- 
transduced B cells also vigorously migrated towards recombinant 
CCL21 or CCL19 (Extended Data Fig. 6b). GPR174 or CCR7 transfection 
of 293T cells—which do not endogenously express these proteins— 
conferred a comparable binding capacity to CCL21 (Extended Data 
Fig. 6c). Finally, GPR174- or CCR7-expressing 293T cells responded 
to CCL21 with calcium fluxes in a dose-dependent manner (Extended 
Data Fig. 6d, e). The estimated half maximal effective concentration 
(EC;9) for CCL21 to trigger either GPR174 or CCR7 was around 15 nM 
(with 95% confidence intervals of 9-23 nM for GPR174 and 8-28 nM 
for CCR7). Therefore, similarly to CCR7, GPR174 is a binding and sig- 
nalling receptor for CCL21. This is consistent with the fact that CCL21 
is produced by stromal cells in the T-cell zone, and that GPR174 
overexpression led B cells to position towards the T-cell-B-cell border 
(Fig. 1a, b). 

To test whether endogenous GPR174 contributes to CCL21-directed 
B-cell migration differently between the two sexes, we first examined 
naive B cells. As shown in Fig. 2a, naive B cells migrated to CCL21 weakly 
in general, with barely 20% migrating at CCL21 concentrations of 
1,000 ng mI“. GPR174 deficiency did not have any effect on female naive 
B cells (compare the red and pink titration curves in Fig. 2a); for male 
naive cells, however, GPR174 deficiency led toa slight but discernible 
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Fig. 4| B-cell GPR174 reduces EAE susceptibility and autoantibody titresin 
male mice. a, EAE disease scores of 12-week-old mixed bone-marrow chimaeras 
of the indicated sexes, constructed with 80% MT and 20% GPR174-sufficient 
or -deficient bone-marrow cells. Data (n=10 mice each group) are from one of 
two independent experiments with similar results. b, Anti-MOG IgG titres in the 
indicated chimaera on day 30 after EAE induction. Data (n=16 mice each 


reduction in migration to 1,000 ng ml“ CCL21 (wild type, 20.5 + 1.0% 
migrated; knockout, 14.5 + 1.3%; P< 0.0001). 

Next, we examined activated B cells. As shown in Fig. 2b, activated 
male B cells migrated more strongly than female B cells towards CCL21, 
reaching twice as high a percentage migrated as the female cells at 
all CCL21 concentrations (compare the blue and red curves). GPR174 
deficiency led to a roughly 50% reduction in migration by male cells 
(blue and cyan curves), down to a level comparable to that seen with 
wild-type female cells (the cyan and red curves); GPR174 deficiency in 
female activated B cells also led to a reduction, but of a smaller mag- 
nitude (compare the red and pink curves). 

To test whether sex hormones are responsible, we further examined 
Bcells from gonadectomized male and female mice. GPR174-deficient 
naive B cells isolated from orchidectomized mice actually migrated 
moreinresponseto 1,000 ng mI‘ CCL21 (Fig. 2c) (wild type, 12.0 + 0.9%; 
knockout, 16.8 + 1.8%; P= 0.0017). GPR174-deficient female naive B 
cells also migrated more than wild-type cells following ovariectomy 
(compare the red and pink curves in Fig. 2a, c). For activated male B 
cells, isolated from orchidectomized mice, CCL21-induced migration 
was down to a level similar to that of female B cells isolated from normal 
mice (compare the blue curve in Fig. 2d with the red curve in Fig. 2b) 
and, notably, toa level similar to that of GPR174-deficient B cells from 
normal or orchidectomized mice (compare the blue curve in Fig. 2d to 
the cyancurves in Fig. 2b, d). Thus, orchidectomy substantially reduced 
GPR174-mediated migration to CCL21 in male activated B cells. Onthe 
other hand, ovariectomy did not change the comparatively smaller 
effect of GPR174 on female B cells (red and pink curves in Fig. 2b, d). 

The data presented in Fig. 2a—d are from one experiment. Summary 
statistics from three such independent experiments are presented 
in Fig. 2e, f, in which the migrated fraction in each GPR174-deficient 
group is subtracted from that in the corresponding wild-type group 
to quantify GPR174-dependent migration. For naive B cells, GPR174 
mediates a small response only to high concentrations of CCL21 in 
the male, but this effect is absent in female cells or in male cells from 
orchidectomized mice (Fig. 2e). For activated B cells, GPR174 medi- 
ates much stronger migration (moreso in males than in females), and 
orchidectomy abrogates the margin by which male outperform female 
cells (Fig. 2f). 

Although testosterone treatment in vitro for up to two days did not 
alter B-cell migration to CCL21 (data not shown), two-week testoster- 
one treatment of orchidectomized mice rescued their B-cell migra- 
tion to CCL21 (Extended Data Fig. 7a). Furthermore, B cells isolated 
from testosterone-treated female mice became much more efficient in 
migrating to CCL21 (Fig. 2g). We did not detect any difference in GPR174 
or CCR7 expression by B cells from male, female, gonadectomized or 
testosterone-treated mice (Extended Data Fig. 7b-g). Transcriptomic 
analyses of B cells from sham-operated or gonadectomized male or 
female mice revealed no differential expression of known guidance 
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graph. 


receptors (R.Z. et al., unpublished observations). GPR174 expression 
did not differ between male and female B cells during an active response 
in vivo (Extended Data Fig. 7h). Therefore, the male hormone condi- 
tions GPR174-mediated B-cell migration to CCL21, although receptor 
expression is not the point of regulation. 

GPCR-mediated chemotaxis depends on coupling with Gai pro- 
teins, and coupling to Gal2/13 proteins inhibits directional migration 
towards ligands. The same GPCR can couple to different Ga proteins 
in different cell types or in the same cell type of different sexes, as 
in the case of the corticotropin-releasing-hormone receptor". To 
test whether GPRI174 is differentially coupled to Gai and/or Gal2/13 
proteins in male and female B cells, we created a transgenic mouse 
line in which bacterial artificial chromosomes (BACs) were used to 
express a functionally validated construct consisting of GPR174 fused 
to green fluorescent protein (GFP) (Extended Data Fig. 8). We then 
immunoprecipitated GPR174-GFP-associated proteins from primary 
Bcells using the GFP as a tag. B cells did not markedly express Gai-3 or 
Gai12 (data not shown), and our analyses thus focused on Gai-1, Gai-2 
and Gal3. As shown in Fig. 3a, GPR174 in activated male or female 
B cells was associated with Gal3 but minimally with Gai-1 or Gai-2, 
consistent with Gal3 activation by GPR174 in an assay of transforming 
growth factor-a (TGFa) shedding”. We also found that, upon CCL21 
stimulation, GPR174 association with Gal3 became higher and asso- 
ciation with Gai-1 and Gai-2 was markedly increased, consistent with 
GPR174-promoted migration to CCL21; notably, the GPR174-Gai 
association was stronger in male than in female B cells, whereas the 
GPR174-Gal3 association showed no difference between the sexes. 
For B cells isolated from female GPR174—GFP mice that were treated 
with testosterone, the profile of Ga association became similar to 
that of male B cells, exhibiting overtly increased CCL21-dependent 
Gai association and unchanged Gal3 association (Fig. 3b). These data 
show that ligand-engaged GPR174 is differentially coupled to Gai in 
male and female B cells, and reveal a likely mechanism for the sexual 
dimorphism of GPR174-mediated migration to CCL21. 

Inanactive immune response, GPR174-deficient male B cells became 
more concentrated within the follicular dendritic cell network and 
made a much larger contribution to germinal-centre formation 
(Extended Data Fig. 9), whereas GPR174 made no difference to B-cell 
localization or germinal-centre magnitude in females (Extended Data 
Fig. 10 and data not shown). This indicates that the sexually dimorphic 
GPR174 response to CCL21is one reason why humoral responses are 
weaker in males than in females. To test whether this GPR174 effect 
impinges on susceptibility to B-cell-dependent autoimmune diseases, 
we resorted to experimental autoimmune encephalomyelitis (EAE), 
which is induced in mice by immunization with human myelin oligo- 
dendrocyte glycoprotein (MOG,_,..)—a model that recapitulates the 
requisite role of B cells in human multiple sclerosis”. We immunized 
male pMT:Gpr174‘, male uMT:Gpr174 ", female uMT:Gpr174*"* and 
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female uMT:Gpr174" bone-marrow (80/20) chimaeras with human 
MOG,_»9 protein and followed the disease course. All mice developed 
symptoms of EAE, with females suffering a more severe disease course; 
GPRI174 deletion did not change the disease course in females but did 
exacerbate it in males (Fig. 4a). Correlating with this finding, male 
UMT:Gpr174"” chimaeras produced the lowest MOG, ,»9-specific anti- 
body titres, which were increased in male pMT:Gpr174”" chimaeras 
toa level similar to that in females (Fig. 4b). Therefore, the sexually 
dimorphic functions of GPR174 in B cells contribute substantially to 
different disease susceptibilities in the two sexes. 

Our study identifies GPR174 as a receptor for CCL21, and reveals its 
chemotactic effects on activated B cells as a mechanism for sexual 
dimorphism in humoral responses and autoimmunity. Testosterone- 
conditioned differential coupling to Gai proteins underlies the different 
effects of GPR174 on male and female B-cell migration to CCL21. Given 
the well-established role of CCR7 in CCL21-mediated chemotaxis, it is an 
intriguing possibility that CCL21 binding might induce GPR174—CCR7 
heterodimerization and achieve a signalling outcome that these two 
receptors cannot attain individually. When CCR7 is ablated in B cells, 
CCL21-induced GPR174-Gai coupling and GPR174-mediated migration 
are markedly reduced (R.Z. et al., unpublished observations), suggest- 
ing a dual role for CCR7 in governing the B-cell migratory response 
to CCL21 via itself and GPR174. LysoPS restrains regulatory T cellsina 
GPR174-dependent manner”. Future studies will need to elucidate how 
tripartite CCL21-GPR174—-CCR7 interactions regulate Gai coupling and 
downstream signals in atestosterone-dependent manner, and whether 
and how LysoPS and CCL21 might compete or collaborate in trigger- 
ing GPR174-dependent functions. Although the functions of GPR174 
in humans remain to be validated, antagonists that selectively target 
GPR174 binding to CCL21 or selectively inhibit GPR174-Gai coupling 
in B cells might boost vaccine outcomes in otherwise poorly respond- 
ing men. 
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Methods 


Mice 

The following mice were originally from the Jackson Laboratory: C57BL/6 
(Jax664), UMT (Jax2288), GFP-expressing (Jax4353), cyan fluorescent 
protein (CFP)-expressing (Jax 4218), dsRed-expressing (Jax 6051), 
OVA3)3-339-Specific T-cell-receptor transgenic OT-II (Jax 4194) and HEL- 
specific lg-transgenic MD4 (Jax2595). GPR174-deficient mice were gener- 
ated by standard gene-targeting procedures to replace the Gpr174 open 
reading frame with aLacZ/neo cassette using the 129SvEvBrd embryonic 
stem cell line (Texas A&M Institute for Genomic Medicine, TGO128). These 
GPR174-deficient mice were backcrossed to C57BL/6 mice for 12 genera- 
tions. Relevant mice on the C57BL/6 background were interbred to obtain 
GFP-expressing MD4 mice and dsRed-expressing GPR174-sufficient or 
-deficient MD4 mice. Age-matched littermates between 6 and 12 weeks 
of age and of the indicated sexes were used for experiments. Sample 
sizes for mouse experiments were empirically determined, and mice 
were randomly assigned to control or experimental group. No blinding 
was necessary for the mouse experiments presented here. All mice were 
maintained under specific-pathogen-free conditions and were used in 
accordance with governmental and Tsinghua Institutional Animal Care 
and Use Committee guidelines for animal welfare. 


GPR174-GFP BAC transgenic mice 

Sequences coding for the glycine-glycine-glycine-serine (GGGS) 
linker (repeated four times) and enhanced GFP (EGFP) were inserted 
immediately before the stop codon of the GPR174 open reading frame 
in the mouse BAC clone RP23-206J14 by homologous recombination. 
The modified BAC was purified and microinjected into the pronuclei of 
fertilized ova to generate transgenic reporters. A founder mouse that 
was screened positive for GFP-expressing B cells inthe blood was used 
to further breed with B6 mice to establish the GPR174—GFP BAC strain. 


Cell culture, retrovirus and in vitro transduction 

Naive T cells or B cells were isolated using the Negative CD4 T-cell Isola- 
tion Kit or the Naive B-cell Isolation Kit (Miltenyi Biotec) according to 
the manufacturer’s protocols. To overexpress target genes in B cells, 
purified B cells were activated with 1 pg ml“ lipopolysaccharide (Sigma) 
for 1 day before being spin-infected with retroviral supernatants at 
1,500g for 2h, as described"®. 


Construction of bone-marrow chimaeras 

B6 recipient mice were lethally irradiated by X-ray (5.5 Gy, twice) and 
then given an intravenous transfer of 3 x 10° sex-matched bone-marrow 
cells, consisting of 80% pMT cells and 20% GPR174-sufficient or -defi- 
cient cells. Chimaeras were used for experiments six to eight weeks 
after reconstitution. 


Gonadectomy 

Mice after weaning were anaesthetized with avertin (2.5%, 0.015 ml g* 
body weight) intraperitoneally. For ovariectomy of female mice, ova- 
ries on both sides were exposed and removed through bilateral dorsal 
incisions. For orchidectomy in male mice, a median abdominal incision 
was made to expose and remove testicles on both sides. Sham-operated 
control mice underwent surgery including bilateral dorsal incisions or 
a median abdominal incision without removal of ovaries or testicles. 
Surgical wound openings were sutured, and antibiotics and analgesics 
were applied locally. Mice were allowed to recover for eight weeks 
before subsequent experiments. 


Testosterone treatment 

Six- to eight-week-old mice were subcutaneously injected with tes- 
tosterone (10 mg kg“ body weight) dissolved in sunflower seed oil 
every other day for two weeks. Vehicle control mice were treated with 
sunflower seed oil alone. 


Adoptive transfer, immunization and viral infection 

To measure germinal-centre formation by MD4 B cells, 10° OT-IIT cells 
and5x10°MD4 B cells of indicated genotypes were intravenously trans- 
ferred into male B6 recipients, which were subsequently immunized 
subcutaneously with 0.5 pg lipopolysaccharide and 30 pg HEL-OVA 
conjugate antigen in alum (Thermo Scientific). The HEL-OVA was made 
by chemical crosslinking with a HydraLink conjugation kit (SoluLink) 
as previously described’. To measure germinal-centre formation in 
response to SRBC immunization or lymphocytic choriomeningitis 
virus (LCMV) infection, mice were injected intraperitoneally with 5 x 10° 
SRBCs (from Z. Baiji) or 2 10° plaque-forming units of the LCMV Arm- 
strong virus (from L. Ye and R. Ahmed). 


EAE by MOG protein immunization 

The sequence encoding the first 120 amino acids of human MOG was 
amplified by polymerase chain reaction (PCR) from a human brain 
complementary DNA library, and cloned between the Ndel and Xhol 
sites of the pET22b+ expression vector (Novagen). Human MOG pro- 
tein was expressed in BL21(DE3) Escherichia coli and purified by its 
carboxy-terminal histidine (His) tag. Protein purity and concentra- 
tion were assessed by SDS-PAGE and bicinchoninic acid (BCA) protein 
assay, respectively. Recombinant proteins were stored at -80 °C until 
use. To induce EAE, bone-marrow-chimeric animals of indicated types 
were immunized subcutaneously at the flank with 200 pg recombinant 
human MOG protein emulsified in complete Freund’s adjuvant on day 
O. Mice were injected intravenously with 200 ng of pertussis toxin 
(Invitrogen) on days 0 and 2. Mice were monitored daily ina blind man- 
ner for disease progression from day 1. Disease severity was scored as 
following: 0, no clinical signs; 1, paralysed tail; 2, loss of coordinated 
movement and paresis of hind limbs; 2.5, paralysis of one hind limb; 
3, paralysis of both hind limbs; 3.5, paralysis of both hind limbs and 
weakness in forelimbs; 4, paralysis of forelimbs; and 5, moribund. To 
compare disease severity, we studied ten mice per condition per experi- 
ment, and analysed disease scores over time with two-way ANOVA. 


Flow cytometry 

Single-cell suspensions were incubated in MACS buffer (phosphate- 
buffered saline (PBS) supplemented with 1% fetal bovine serum (FBS) 
and 5 mM EDTA) containing 20 pg ml‘ 2.4G2 (Bioxcell) for 20 min 
before being stained with indicated monoclonal antibodies. Stain- 
ing reagents included: BV421-anti-CD4 (GK1.5), APC-Cy7-anti-CD19 
(1D3), PE-Cy7-anti-CD95 (Jo2), AlexaFluor-700-anti-B220 (RA3-6B2), 
eFlour450-anti-B220 (RA3-6B2) and APC-anti-CD138 (281-2) from BD 
Biosciences; and eFlour450-anti-GL7 (GL-7) and FITC-anti-GL7 (GL-7) 
from eBioscience; PerCP Cy5S.5 anti-IgD (HK1.4), biotinylated anti-CCR7 
(4B12), and streptavidin-APC (405207) from Biolegend. Dead cells were 
excluded from analysis by staining with 7-AAD (Biotium) or using the 
Zombie Yellow Fixable Viability kit (Biolegend). All cytometry data 
were collected on an LSRII or FACSAria Ill cytometer (BD Biosciences) 
and analysed with FlowJo software (TreeStar). 


Measurement of antigen-specific antibody titres 

To measure SRBC-specific antibody titres, we lysed 5 x 10°SRBCs with 
1ml double-distilled water and centrifuged them at 15,000 r.p.m. for 
20 min. The pellet was resuspended in PBS and used to coat MaxiSorp 
enzyme-linked immunosorbent assay (ELISA) plates (Nunc Maxisorp) 
overnight at 4 °C. To measure MOG-specific serum antibodies, we 
used 2 pg ml purified recombinant human MOG to coat the ELISA 
plates. Nonspecific binding was blocked with 1% bovine serum albu- 
min (BSA) in PBS with Tween-20 (PBST) for 2 h at 37 °C, followed by 
incubation with 2x serial dilutions of serum samples of immunized 
mice for 1h at 37 °C. Plates were washed and incubated with horse- 
radish peroxidase (HRP)-conjugated anti-mouse IgG secondary anti- 
body (1/20,000) for 1h at 37 °C. Plates were washed, developed with 


Article 


3,3’,5,5’-tetramethylbenzidine (TMB) and stopped with 1M HCI. Weset 
blank wells as zero, and recorded absorbance at 450 nm. We used sera 
from unimmunized mice as the negative control; the cut-off value was 
the optical density at 450 nm (OD,,;,) of the negative control multiplied 
by 2.1. A well with an OD,,, value of no less than the cut-off value was 
considered positive. The highest dilution of asample that gave positiv- 
ity is the titre of the sample. 


Distribution of B cells by immunohistochemistry 

For various experiments, naive B cells, B cells activated for 1h with 
10 pg mI F(ab’), goat anti-mouse IgM (Jackson Immunoresearch), or 
Bcells transduced with retrovirus were transferred into B6 recipients. 
When required, these cells were labelled with 1 uM tetramethylrho- 
damine (TAMRA), 4-chloromethyI-6,8-difluoro-7-hydroxycoumarin 
(CMF2HC) or 5-chloromethylfluorescein diacetate (CMFDA) (Invit- 
rogen). Spleens or inguinal lymph nodes of the recipient mice were 
fixed with 1% paraformaldehyde for 12 h and then dehydrated in 30% 
sucrose solution for 12 h at 4 °C. To ensure maximum representation of 
different organ regions in the final dataset, we processed nonconsecu- 
tive tissue sections and stained them with indicated antibodies. Stain- 
ing reagents included eFluor450-IgD (eBioscience), APC-CD35 (BD), 
PE-CD3 (BD), eFluor450-B220 (eBioscience), rabbit anti-GFP (abcam), 
and AF488 goat anti-rabbit IgG (Invitrogen). Slides were mounted with 
the ProlongGold Antifade reagent (Invitrogen) and examined with 
an Olympus FV1000 upright microscope. Images were analysed with 
Imaris (Bitplane) and Image) 1.46r (National Institutes of Health). 


Splenic stroma preparation 

Spleens from mice, rats or pigs were repeatedly pressed and ground 
against a metal mesh to release red blood cells and lymphocytes. The 
remaining unsuspendable, amorphic stromal tissue elements were 
thoroughly washed with PBS before being placed in B27 serum-free 
culture medium (Invitrogen) and incubated at 37 °C for 12h. For mice, 
stromal elements from three animals were cultured in 1 ml medium. 
For pigs, stromal elements from 1 spleen were cultured in 80 ml. The 
resultant culture was centrifuged at 10,000 r.p.m. for 60 min to obtain 
the conditioned medium, which was either immediately used for experi- 
ments or frozen at —20 °C until use. 


Transwell assay 

B cells activated with 1 pg ml“ lipopolysaccharide for 60 h were used 
to assay different fractions resulting from biochemical fractionation. 
Naive B cells or B cells activated with 10 pg ml F(ab’), goat anti-mouse 
IgM (JacksonImmunoresearch) and 10 pg mI anti-CD40 (clone FGK4.5, 
Bio X Cell) were used to assay CCL21-induced migration. When B cells 
of different genotypes were compared, at least three donor mice of 
each genotype and treatment were used to isolate B cells in each experi- 
ment, and littermates were always used. B cells were suspended at 
10° cells per millilitre and rested in RPMI medium containing 1% FBS 
at 37 °C for 1h. Next a total of 100 pl cell suspension was added to the 
upper chamber ina 5-um-pore Transwell (Corning Costar), and a total 
of 150 pl attractant-containing solutions was added to the bottom 
chamber. These solutions included culture medium conditioned with 
crude mouse splenic stroma preparation, culture medium conditioned 
with crude porcine splenic stroma preparation, elution fractions from 
chromatography, culture medium containing recombinant CCL21 and 
CCL19 (Peprotech), or 18/0 LysoPS (Avanti Polar Lipids) of indicated 
concentrations. For some experiments, murine stroma-conditioned 
medium was first supplemented witha final concentration of 5 pg mI 
neutralizing antibodies against mouse CCL21 or CCL19 (R&D system) 
or goat IgG isotype control, and incubated at 4 °C for 3 h before being 
used for the transwell assay. The antibody dose was at least sufficient 
toneutralize 100 ng mI CCL21 or CCL19, as determined in preliminary 
experiments. Cells were allowed to transmigrate for 3 h at 37 °Cinan 
incubator. Cells that had migrated to the bottom wells were enumerated 


by flow cytometry, with fluorescent A20 cells of known numbers added 
immediately before reading as an internal counting standard. Each 
condition was measured in triplicate wells unless indicated otherwise. 


Biochemical fractionation 

Chromatography was carried out using an AKTApurifer 10 fast protein 
liquid chromatography (FPLC) system (GE Healthcare) at 4 °C, withthe 
exception of the last step, which was carried out using an AKTAmicro 
FPLC system (GE Healthcare). A total of 1,200 ml porcine stroma-con- 
ditioned medium was applied to an anion-exchange column (250 ml 
bed volume) equilibrated with buffer I (25 mM Hepes, pH 7.0). The 
column was washed with three column volumes before being eluted 
with three column volumes of buffer 1 supplemented with 1M NaCl. The 
flow-through was buffer-changed with buffer II (25 mM Tris-HCl, pH 8.5) 
into 100 ml and reapplied to an anion-exchange column equilibrated 
with buffer II. The column was washed with two column volumes of 
buffer Il and eluted with two column volumes of a0.1-0.5 M linear NaCl 
gradient. Fractions of 10 ml each were collected, and aliquots were 
buffer-changed with RPMI 1640 for the transwell assay. Active fractions 
were pooled and buffer-changed into 10 ml with buffer II and loaded 
onto aheparin column (5 ml bed volume) equilibrated with buffer II. The 
column was washed with 12 column volumes of buffer II and eluted with 
4column volumes of 0-2 M linear NaCl gradient. Fractions of 2ml each 
were collected and aliquots were buffer-changed with RPMI 1640 for 
the transwell assay. Active fractions were again pooled, buffer-changed 
into 0.5 ml buffer I and loaded onto a Superdex-75 column (24 ml bed 
volume) equilibrated with buffer I. The column was then eluted with 
one-column volume of buffer I. Fractions of 0.5 ml each were collected, 
and aliquots were buffer-changed with RPMI 1640 for the transwell 
assay. Active fractions were pooled and concentrated into 0.5 ml to load 
ontoaMonoScolumn (1 ml bed volume) equilibrated with buffer I. The 
column was washed with 12 column volumes of buffer I and eluted with 
25 column volumes of 0-0.5 M linear NaCl gradient. Fractions of 1 ml 
each were collected, and aliquots were buffer-changed with RPMI 1640 
for the transwell assay. Active fractions were pooled, buffer-changed 
and concentrated into 50 ul with buffer I, and reloaded onto the Mono 
Scolumn equilibrated with buffer I. The column was washed with three 
column volumes of buffer I containing 0.3 M NaCl, and eluted with 24 
column volumes of 0.3-0.4 M linear NaCl gradient. Fractions of 1 ml 
each were collected, and aliquots were buffer-changed with RPMI 1640 
for the final transwell assay to identify the fraction that contained the 
strongest chemoattractant activity. 


Mass spectrometry analysis 

Fractions from the last purification step were concentrated into 50 ul 
and analysed on 12% NuPAGE gels (Invitrogen). Gels were stained with 
the Pierce silver stain for mass spectrometry (Thermo Fisher Scientific) 
according to the manufacturer’s protocol. Bands of increased intensi- 
ties in the fraction containing the strongest chemoattractant activity 
were excised and subjected to in-gel digestion before being analysed 
with a Q Exactive HF mass spectrometer (Thermo Fisher Scientific). 
Mass spectrometry data were analysed using the Swissprot database 
with the MASCOT search engine. 


His-tagged recombinant CCL21 and binding assay 

The mouse CCL21 coding sequence was amplified from C57BL/6 mouse 
spleen cDNA and cloned between the Ndel and Xhol sites of the pET22b+ 
expression vector (Novagen), which provides a C-terminal His tag. The 
recombinant protein was expressed in BL21(DE3) mice and purified by 
its His tag. Protein purity and concentration were assessed by SDS- 
PAGE and BCA assay, respectively, and bioactivity was verified using 
acalcium mobilization assay. Recombinant proteins were stored at 
-80 °C until use. To measure CCL21-His binding to GPR174, with CCR7 
as a positive control, 293T cells were transfected with GPR174—GFP or 
CCR7-GFP fusion constructs. Aliquots of transfected cells were then 


incubated with CCL21-His protein at indicated concentrations at 37 °C 
for 30 min, washed twice with cold PBS, and fixed immediately with 4% 
paraformaldehyde. After washings in PBS, the fixed cells were stained 
with a phycoerythrin-conjugated anti-His antibody (clone JO95G46, 
Biolegend) and analysed by flow cytometry. GFP cells in each sample 
served as anonspecific background staining internal control, andthe 
mean fluorescence intensity (MFI) of GFP* cells stained with phyco- 
erythrin-conjugated anti-His antibody, with the MFI of corresponding 
GFP’ cells subtracted, was used to quantify specific CCL21 binding. 


Measurement of chemokine-triggered calcium fluxes 

Mouse Gpr174 and Ccr7 were amplified by PCR and cloned into the 
pRKS plasmid. HEK293T cells were transiently transfected with GPR174- 
expressing, CCR7-expressing or control plasmid. Cells were physically 
detached from the culture vessel 48 h after transfection, washed twice 
with PBS, and stained with 1 pM Indo-1 (Invitrogen) in PBS at 37 °C for 
30 min. After being washed twice with PBS, cells were resuspended with 
Hanks’ balanced salt solution (Invitrogen) containing 25 mM Hepes and 
1% FBS, and kept on ice. When subjected to chemokine stimulation, cells 
were first brought into a water batch at 37 °C for 5 min of incubation, 
and then assessed on an LSR II cytometer (BD Biosciences) to establish 
the baseline for the unstimulated state. Cells were stimulated witha 
concentration series of recombinant CCL21 ranging from 0.1ng mI"to 
10 pg mI”. Cells at each condition were continuously monitored and 
recorded for 2 min. At the end, ionomycin (Sigma) was further added 
to the sample at a final concentration of 1 pg mI”, and the sample was 
further monitored and recorded for 1 min. Indo-1(bound)/Indo-1(free) 
ratios were used to indicate intracellular Ca”* concentrations, as ana- 
lysed in FlowJo (TreeStar). The highest Ca”* concentration stimulated 
by ionomycin was used as 100% to normalize the calcium response 
induced by CCL21. The EC;, was estimated in GraphPad by fitting a 
three-parameter, nonlinear dose-response curve. 


Immunoprecipitation and western blotting 

Freshly isolated mouse B cells from GPR174—GFP BAC transgenic mice 
were activated with 10 pg mI! F(ab’), goat anti-mouse lgM and10 pg mI" 
anti-CD40 for 48 h. These activated B cells were then suspended at 
2x10’ cells per millilitre in REMI medium containing 1% FBS, and left 
untreated or stimulated with 300 ng ml CCL21 at 37 °C for 30 min. B 
cells were then immediately lysed in 1% NP-40 lysis buffer (25 mM Tris- 
HCI, pH 7.4, 137 mM NaCl, 1% Nonidet P-40, 20% glycerol and protease 
inhibitors). For immunoprecipitation of GFP-tagged GPR174, lysates 
of 10°B cells were incubated overnight with 20 pl of GFP-Trap beads 
(ChromoTek). After repeated washings, immunoprecipitates were 
eluted by incubation in 0.2 M Glycine buffer (pH 3.0) for 10 min at room 
temperature, and then neutralized with 1M Tris-HCl (pH 8.5). Proteins 
were separated by SDS-PAGE and transferred to polyvinylidene mem- 
brane (Millipore). Membranes were blocked with Tris-buffered saline 
containing 5% BSA and 0.1% Tween 20. To detect target molecules by 
immunoblotting, we used rabbit anti-GFP (Abcam), rabbit anti-Gai-1 
(Abcam), rabbit anti-Gai-2 (Abcam) and rabbit anti-Gal3 (Abcam) anti- 
bodies. HRP-conjugated goat anti-rabbit antibody was purchased from 
Bioeasytech. Immunoblots were detected using enhanced chemilumi- 
nescence (Thermo Fisher Scientific), and images were analysed with 
Image) software. 


Quantitative PCR 
Cells of desired types were sorted with a FACSAria Ill and subjected to 
total RNA extraction with the RNeasy Plus Mini or Micro kit (Qiagen) 


according to the manufacturer’s instructions. RNA was reverse- 
transcribed with All-In-One RT MasterMix (Abm). Quantitative PCR was 
performed with qPCR MasterMix (Abm) ona 7500 Real-Time PCR sys- 
tem (Applied Biosystems). Primers used were as follows: Gapdh sense 
5’-TGTTCCTACCCCCAATGTGTC, antisense 5’-TAGCCCAAGATGCC 
CTTCAGT; Gpr1/4 sense 5’-AGGCCACACACCTTTTTCCC, antisense 
5’-CAGGCCAGGACATCATGGAA,; SIpr2 sense 5’-CAACTCCGGGACATA 
GACCG, antisense 5’-CCAGCGTCTCCTTGGTGTAA; Gpr183 sense 
5’- CATAAAAGGACGCCTGCTGG, antisense 5’- TTTCCCACCAGCCC 
AATGAT; Ccr7 sense 5’-GGTGGCTCTCCTTGTCATTTTC, antisense 
5’- TACGTCAGTATCACCAGCCC; Cxcr5 sense 5’- ACTACCCACTAAC 
CCTGGACA, antisense 5’- CGAGGTGGAACAGGAAGGTC. Relative 
expression of target genes among different samples was compared 
after normalization against expression of the housekeeping genes 
Actb or Gapdh. 


Statistical data analysis 

Statistics and graphing were conducted in Prism (Graphpad). Unless 
indicated otherwise, two-tailed unpaired Student’s t-tests were used 
to compare endpoint means of different groups. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Data generated here are included within the paper (and its Supplemen- 
tary Information files) or available from the corresponding author 
upon reasonable request. Source Data for Figs. 1-4 and Extended Data 
Figs. 1-10 are provided with the paper. 
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against the dsRed/GFP ratio in transfer inoculum. Each symbol represents one horizontal lines denoting mean (b, d); two-tailed unpaired Student's t-tests 
mouse (n=17 and 16); data were pooled from three independent experiments were used to calculate the Pvalues at the top of each graph. 
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Extended Data Fig. 3| GPR174 suppresses germinal-centre formationin 
male, but not female, mice. a, Representative cytometric profiles of FAS"GL7™ 
frequencies of transduced MD4 B cells in B6 recipients five days after 
immunization. b,c, Representative cytometric profiles of germinal centres (b) 
and SPPCs (c) in mice of the indicated genotypes and sexes seven days after 
SRBC immunization, matching the summary statistics presented in Fig. 1d, e, 
respectively. d, Representative cytometric profiles (left panels) and summary 
statistics (right panel) of germinal centres in littermate mice of the indicated 


genotypes and sexes, eight days after infection with LCMV Armstrong (10° 
plaque-forming units per mouse). The scatter plot (right) shows individual 
mice with lines denoting mean (n=15, 15, 14 and 14 mice, from left to right). Data 
pooled from three independent experiments with similar results. A two-tailed 
unpaired Student’s ¢-test was used. e, f, Representative cytometric profiles of 
germinal centres (e) and SPPCs (f) in mixed bone-marrow (BM) chimeric mice 
of indicated types seven days after SRBC immunization, matching the 
summary statistics presented in Fig. 1g, h, respectively. 


Extended Data Fig. 4 | Endogenous GPR174 does not affect the follicular 
distribution of naive B cells or the accumulation of freshly activated B cells 
tothe T-cell-B-cell border. a, Splenic distribution of GPR174-sufficient and 
-deficient naive B cells 24 h after adoptive transfer. WT, wild type; KO, 
knockout. Data are representative of two independent experiments involving 
either male or female cells. Scale bars, 50 pm. b, Left, splenic distribution of 
GPR174-sufficient or -deficient B cells that were stimulated with anti-IgM, 6h 
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the T-cell-B-cell border and the T-cell zone, based onIgD and CD3 staining. 
Right, relative abundance of transferred B cells in the indicated regions. Data 
(n=24 biological replicates) represent two independent experiments with 
similar results. Lines in scatter plots denote mean. We used two-way ANOVA 
with Bonferroni’s multiple comparisons; ns, not significant. 
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Extended Data Fig. 5| See next page for caption. 


Extended Data Fig. 5 | Minimal LysoPS effects on GPR174-dependent B-cell 
migration and GPR174 ligand identification. a, Transwell migration of 
lipopolysaccharide-activated GPR174-sufficient or -deficient B cellsin 
response to LysoPS of indicated concentrations. Data are plotted as 

mean +s.e.m. of the percentage of cells that migrated in triplicated wells at 
each concentration, fitted with three-parameter log dose-response curves; 
two-way ANOVA was used to compare the two groups. One of two independent 
experiments with similar results is shown. b-d, Transwell migration of 
lipopolysaccharide-activated B cells that were transduced witha control or 
GPR174-expressing vector, in response to stroma-conditioned medium 
(stroma), dimethylsulfoxide (DMSO) control or LysoPS of indicated 
concentrations (b); or in response to stroma-conditioned medium (stroma), 
conditioned medium heated at 100 °C for 10 min (heated), conditioned 
medium treated with the proteinase K inhibitor PMSF alone (PMSF), 
conditioned medium treated with proteinase K and then PMSF (PK + PMSF), or 
conditioned media treated as above and then further supplemented with 

300 ng mI CXCL13 (PK + PMSF + CXCL13) to exclude cell damage owing to 
remaining proteinase activity (c); or inresponse to blank culture medium, or 


mouse or porcine stroma-conditioned media (d). All data are plotted as the 
mean percentages of cells that migrated in triplicated wells, from one 
experiment representative of three with similar results. Two-way ANOVA with 
Bonferroni’s multiple comparison tests were used to compare vector and 
GPR174 groups, with Pvalues given in the graphs. ****P< 0.0001; ns, not 
significant. e, Workflow showing the six-step biochemical fractionation 
method for identifying GPR174 ligands in splenic stroma-conditioned medium. 
See Methods for details. CV, column volume. f-k, Chromatography traces and 
putative ligand activities as detected by the transwell assay ind in relevant 
fractions during the six steps of e. AU, arbitrary units; mS, millisiemens. I, Left, 
silver staining of the indicated fractions from k resolved on 12% SDS-PAGE; the 
black box on fraction 5 marks bands subjected to liquid chromatography-mass 
spectrometry (LC-MS)/MS analysis. Right, identified unique porcine CCL21- 
derived peptides (solid underlines) aligned against porcine, human and murine 
CCL21 protein sequences. For gel source data, see Supplementary Fig. 1. Data 
shown in f-lare from one of two independent experiments with similar results. 
FT, flow-through. 
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Extended Data Fig. 6 |CCL21 and CCL19 are chemoattractant ligands of 
GPR174. a, b, Transwell migration of lipopolysaccharide-activated B cells that 
were transduced with a control or GPR174-expressing vector: a, in response to 
stroma-conditioned medium (stroma) or 100 ng mI‘ recombinant CCL21 or 
CCL19, in the presence of controllgG or 5 pg mI CCL21- or CCL19-blocking 
antibody, individually or in combination; b, in response to recombinant mouse 
CCL21 (left) or CCL19 (right). Data represent triplicated wells for each 
condition from one of three independent experiments with similar results. 
Two-tailed unpaired Student’s t-tests; ****P< 0.0001.c, Cytometric profiles and 
MFI of CCL21 binding to HEK293T cells transfected with control GFP, GPR174- 
GFP or CCR7-GFP fusion constructs, with background staining on the GFP" 
fraction subtracted from the corresponding GFP’ fraction. Left, gating of GFP” 
and GFP’ cells in each group (top) and histograms of anti-His staining (bottom) 
of cells that were incubated with different doses of His-tagged recombinant 


RS) AS) 
CCL21 (M) 


CCL21. PE, phycoerythrin. Right, data represent three biological replicates for 
each condition, from one of three independent experiments with similar 
results. Two-way ANOVA tests were used to compare groups, with Pvalues 
given inthe graph; red asterisks, GFP versus GPR174—GFP; blue asterisks, GFP 
versus CCR7-GFP; green asterisks, GPR174—GFP versus CCR7-GFP; 

****P < 0.0001. d,e, Exemplary calcium responses of HEK293T cells transfected 
with vector, GPR174 or CCR7, following stimulation with CCL21 or the calcium 
ionophore ionomycin at the indicated concentrations (d); and the magnitude 
of CCL21-triggered calcium responses asa fraction of the ionomycin-triggered 
maximum of the same cells, overlaid with three-parameter log dose-response 
curves ontop (e). The two different symbols indicate data from two 
independent experiments. Pvalues obtained from an extra sum-of-squares 
F-test of the null hypothesis that the GPR174 and CCR7 curves have the same 
EC5o. 
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Extended Data Fig. 7| See next page for caption. 
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Extended Data Fig. 7 | Testosterone dependence of the GPR174-mediated 
migratory response to CCL21 does not involve modulation of GPR174 or 
CCR7 expression. a, CCL21-induced transwell migration of B cells from 


GPR174-sufficient (WT) or -deficient (KO) male littermate mice that were sham- 


operated (sham) or orchidectomized (ORX), allowed to recover for six weeks, 
and thentreated with either vehicle or testosterone for two weeks. Data 
represent three biological replicates from one of two independent 
experiments with similar results. Blue asterisks, male sham wild type plus 
vehicle versus sham GPR174-knockout plus vehicle; red asterisks, male ORX 
wild type plus vehicle versus male ORX wild type plus testosterone; green 
asterisks, male sham wild type plus vehicle versus male ORX wild type plus 
testosterone. b, Relative Gpr174 mRNA levels normalized to Gapdh expression 
in naive and activated B cells from male or female mice that were either sham- 
operated or gonadectomized. Data (n = 3 biological replicates) represent three 
independent experiments with similar results. c, Surface CCR7 levels on naive 
or activated B cells isolated from mice of the indicated GPR174 genotypes, 
sexes and gonadectomies. Data shown are histogram overlays (coloured lines, 


CCR7; grey lines, isotype staining control) and CCR7 MFIs after isotype staining 
background subtraction. Each symbol in the MFI data represents one mouse, 
and data represent three independent experiments with similar results. d,e, 
Relative Gpr174 mRNA levels (d) and surface CCR7 levels (e) on activated B cells 
from male mice that were sham-operated or orchidectomized and then treated 
with either vehicle or testosterone. Data represent three mice per group from 
one of three independent experiments with similar results. f, g, Relative Gpr174 
mRNA levels (f) and surface CCR7 levels (g) on activated B cells from male or 
female mice that were treated with either vehicle or testosterone. Data 
represent three mice per group from one of three independent experiments 
with similar results. h, Relative expression of Gpr174 mRNA in male or female 
dsRed-expressing MD4 B cells normalized against that in cotransferred male 
GFP-expressing MD4 B cells 90 h after activation by HEL-OVA immunization (as 
in Extended Data Fig. 1); three line-connected pairs indicate three independent 
experiments. Two-way ANOVA with Bonferroni’s multiple comparisons (a) or 
two-tailed unpaired (b-g) or paired (h) Student’s t-tests were used for 
statistical comparisons, with Pvalues given inthe graph. 
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Extended Data Fig. 8 | Characterization of GPR174-GFP transgenic mice. 
a, Diagrams of the GPR174-GFP fusion (top) and GPR174-GFP BAC construct 
(bottom) inthe context of the Gpr174 genomic locus, with the five exons 
numbered initalics. b, Transwell migration of lipopolysaccharide-activated B 
cells that were transduced witha control, GPR174-expressing or GPR174-GFP- 
expressing vector in response to stroma-conditioned medium (stroma) or 
10ng ml‘ or50ng ml' recombinant CCL21. Data represent three biological 
replicates from one of two independent experiments with similar results. 


naiveB GCB 


c,d, Histograms (c) and geometric MFI (geo.mean) (d) of GFP fluorescence 
from naive and germinal-centre B cells in B6 (grey histograms) or GPR174—GFP 
BAC (open histograms) transgenic mice. Each symbol inthe MFI plot 
represents one mouse, and data represent two independent experiments with 
similar results. Two-way ANOVA with Bonferroni's multiple comparisons (b) or 
two-tailed unpaired Student’s f-test (d) was used for comparisons between 
groups. ****P< 0.0001; ns, not significant. 
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Extended Data Fig. 9 |GPR174 retards follicular-centre localization and and frequencies of dsRed-expressing and GFP-expressing cells of the indicated 
germinal-centre formation by male B cells. a, Representative distribution genotypes inthe transferred MD4 (left panels) or inthe germinal-centre 
patterns of dsRed-expressing and GFP-expressing MD4 cells of the indicated compartment five days after immunization (middle and right panels). 
genotypes in follicles of the draining lymph nodes 90 h after immunization of d, Summary statistics of the dsRed/GFP ratio in MD4 germinal centres 
male recipient mice. Scale bar, 50 pm. b, Summary statistics of ratios between normalized against the dsRed/GFP ratio in transferred MD4. Each symbol 
CCls, as defined in Extended Data Fig. 1d, of dsRed* cells and GFP* cells of the denotes one mouse (n=19 and 19), with lines denoting mean. Data are pooled 
indicated genotypes in the same follicles. Each symbol denotes one follicle from three independent experiments with similar results. Two-tailed unpaired 
g yp y p p p 
(n=42 and 43), with lines denoting mean. Data are pooled from two Student’s ¢-tests were used to compare groups, with Pvalues given inthe graph. 


independent experiments with similar results. c, Representative contour plots 
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Extended Data Fig. 10 | GPR174 does not affect germinal-centre formation 
by female B cells. a, Representative contour plots and frequencies of dsRed- 
expressing and GFP-expressing cells of the indicated genotypes inthe 
transferred MD4 (left panels) or inthe germinal-centre compartment five days 
after immunization (middle and right panels) of male recipient mice. 


a dsRed 


b, Summary statistics of the dsRed/GFP ratio in MD4 germinal centres, 
normalized against the dsRed/GFP ratio in transferred MD4 cells. Each symbol 
denotes one mouse, with lines showing mean. Data (n= 8 and 8) are pooled 
from two independent experiments with similar results. Two-tailed unpaired 
Student's t-test was used to compare groups. 
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Data collection Flowcytometry data were collected using BD FACSDIVA V8.0.1. 
Imaging data were collected using FV10-ASW V3.1. 
Protein purification data were collected using UNICORN V7.0. 


Data analysis Flowcytometry data were processed and analyzed using FlowJo V10. 
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Protein purification data were processed and analyzed using UNICORN V7.0 
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Sample size The sample size for each experiment is indicated in the figure legend. The sample size was chosen empirically to provide a sufficient level of 
statistical power for detecting indicated biological effects. No statistical methods were used to pre-determine the sample size. 
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Data exclusions No data were excluded from analyses. 
Replication All experimental findings were reproducible across multiple independently repeated experiments, as indicated. 
Randomization Where applicable, littermate animals of desired genotypes were randomly assigned into experimental and control groups. 


Blinding Imaging analyses were conducted in a blinded manner. 
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system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
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Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Antibodies included BV421-anti-CD4 (GK1.5, 562891), APC-Cy7-anti-CD19 (1D3, 557655), PE-Cy7-anti-CD95 (Jo2, 557653), 
AlexaFluor-700-anti-B220 (RA3-6B2, 557957), eFlour450-anti-B220 (RA3-6B2, 558108), APC-anti-CD138 (281-2, 558626) from BD 
Biosciences; eFlour450-anti-GL7 (GL-7, 48-5902-82) and FITC-anti-GL7 (GL-7, 53-5902-82) from eBioscience; PerCP Cy5.5 anti-lgD 
(HK1.4, 405709), biotinylated anti-CCR7 (4B12, 120103), and streptavidin-APC (405207) from Biolegend. Antibodies were used at 
1:200 dilution for staining cells or tissue sections. 


Validation Based on information provided by commercial vendors. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) A20 and 293T cell lines were originally from the American Type Culture Collection (Manassas, VA, USA). 
Authentication No specific procedure was taken to authenticate the cell line identity. 
Mycoplasma contamination The cell lines have been tested negative for mycoplasma contamination. 


Commonly misidentified lines No cell lines used are in the database of commonly misidentified cell lines. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals C57BL/6 (Jax664), WMT (Jax2288), GFP-expressing (Jax4353), CFP-expressing (Jax 4218), dsRed-expressing (Jax 6051), OVA323— 
339-specific T-cell receptor transgenic OT-II (Jax 4194), and HEL-specific Ig-transgenic MD4 (Jax2595) mice were originally from 
the Jackson Laboratory. GPR174-deficient mice were generated by standard gene targeting procedures to replace the Gpr174 
open reading frame with a lacZ/neo cassette using 129SvEvBrd embryonic stem cell line (Texas A&M Institute for Genomic 
Medicine, TGO128). These mutant mice were backcrossed to C57BL/6 for 12 generations. Relevant mice on the C57BL/6 
background were interbred to obtain GFP-expressing MD4 mice and dsRed-expressing GPR174-sufficient or -deficient dsRed 
MD4 mice. Age-matched littermates between 6 and 12 weeks of age and of indicated sexes were used for experiments. All mice 
were maintained under specific-pathogen free conditions and were used in accordance of governmental and Tsinghua guidelines 


> 
jad) 
2 
= 
= 
o 
= 
o 
Nn 
© 
red) 
= 
(a) 
= 
= 
io 
18. 
fo) 
oa 
D 
a 
Za) 
iS 
3) 
S 
fed) 
5 
< 


for animal 
welfare. 
Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from the field. 


Ethics oversight Identify the organization(s) that approved or provided guidance the study protocol, OR stc 


1 WI 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Methodology 
Sample preparation Spleen and LN single-cell suspension was incubated in MACS buffer (phosphate-buffered saline (PBS) supplemented with 1% 
fetal bovine serum (FBS) and 5 mM EDTA) containing 20 ug/ml 2.4G2 (BioXcell) for 20 min before being stained with indicated 
monoclonal antibodies. 
Instrument LSR Il or FACSAria Ill cytometer (BD Biosciences) 
Software Flowcytometry data were processed and analyzed using FlowJo V10. 


Cell population abundance At least 10000 events were acquired for cells in the defined gate. 


Gating strategy For all experiments FSC-A/ SSC-A gates of the starting cell population were used to identify viable cells. Singlet cells were 
identified using FSC-H/ FSC-W gating. Isotype control was used to distinguish between background and marker-positive events. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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KRAS GTPases are activated in one-third of cancers, and KRAS(G12C) is one of the 
most common activating alterations in lung adenocarcinoma’”. KRAS(G12C) 
inhibitors** are in phase-I clinical trials and early data show partial responses in nearly 
half of patients with lung cancer. How cancer cells bypass inhibition to prevent 
maximal response to therapy is not understood. Because KRAS(G12C) cycles between 
anactive and inactive conformation‘ ®, and the inhibitors bind only to the latter, we 
tested whether isogenic cell populations respond ina non-uniform manner by 
studying the effect of treatment at a single-cell resolution. Here we report that, shortly 
after treatment, some cancer cells are sequestered in a quiescent state with low KRAS 
activity, whereas others bypass this effect to resume proliferation. This rapid 
divergent response occurs because some quiescent cells produce new KRAS(G12C) in 
response to suppressed mitogen-activated protein kinase output. New KRAS(G12C) is 


maintained inits active, drug-insensitive state by epidermal growth factor receptor 
and aurora kinase signalling. Cells without these adaptive changes—or cells in which 
these changes are pharmacologically inhibited—remain sensitive to drug treatment, 
because new KRAS(G12C) is either not available or exists in its inactive, drug-sensitive 
state. The direct targeting of KRAS oncoproteins has been a longstanding objective in 
precision oncology. Our study uncovers a flexible non-uniform fitness mechanism 
that enables groups of cells within a population to rapidly bypass the effect of 
treatment. This adaptive process must be overcome if we are to achieve complete and 
durable responses in the clinic. 


KRAS(G12C) undergoes nucleotide cycling between its active (GTP- 
bound) and inactive (GDP-bound) states in cancer cells*~*”, First-in-class 
mutant GTPase inhibitors target KRAS(G12C) ina conformation-specific 
manner: they bind only to the inactive state and trap the oncoprotein 
by preventing its reactivation by nucleotide exchange °*”. Upon treat- 
ment with aKRAS(G12C)-specific inhibitor (G12Ci), KRAS(G12C)-mutant 
cells had an initial inhibition that was followed by a reaccumulation of 
active KRAS (KRAS-GTP) and reactivation of its downstream signalling 
(Extended Data Fig. 1), a pattern that is consistent with adaptation” ”. 
The KRAS(G12C) nucleotide cycle and the conformation-specific nature 
of inhibition led us to test whether adaptation to treatment with G12Ci 
occurs ina non-uniform manner across cancer cells ina population 
(see Methods for rationale). 

To this end, we performed single-cell RNA sequencing (scRNA-seq)?"* 
onthree models of KRAS(G12C) lung cancer, treated with the G12Ci for 
0,4, 24 and 72h (Methods). After controlling for potentially confound- 
ing variables (Extended Data Fig. 2a—e) and reducing the dimensionality 
of the dataset with an algorithm’ that accounts for the zero-inflated 


nature of scRNA-seq data (Extended Data Fig. 2f), cells were clustered 
and projected in atwo-dimensional space using either ¢-distributed sto- 
chastic neighbour embedding (t-SNE) or diffusion component mapping 
(Extended Data Fig. 2g-j). We used trajectory inference analysis’*” to 
order cells along a process (that is, pseudotime) and identify cell fates 
in an unsupervised manner. This analysis revealed three trajectories 
(paths 1-3 in Fig. la, Extended Data Fig. 2k). Two of these represented a 
shift from an initial state (grey cluster) toa drug-induced state (path 1, 
red clusters and arrow in Fig. 1a), and then back (path 2, blue clusters 
and arrowin Fig. 1a). The distribution of clusters over treatment time 
(Fig. 1b) suggested that paths 1 and 2 represent inhibited and adapting 
cell states, respectively. 

To test this, we used about 200 KRAS(G12C)-dependent genes to 
derive a KRAS(G12C)-specific transcriptional output score (Methods), 
which was used as an indicator of KRAS signalling across single cells. At 
baseline most cells had high output, as evidenced by their high G12C- 
induced scores (reflecting genes downregulated upon treatment with 
G12Ci) and low G12C-suppressed scores (reflecting genes upregulated 
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Fig. 1| Divergent single-cell fates after conformation-specific KRAS(G12C) 
inhibition. a, Diffusion component (DC) analysis of single cells from models of 
KRAS(G12C) tumours, treated with a G12Ci for 0, 4,24 and 72h. The arrows 
indicate inhibitory trajectories derived by the Slingshot algorithm. b, Cluster 
composition across treatment time. c, The distribution of KRAS(G12C)- 
dependent transcriptional output score across single cells.d, The trend in 
G12C output as a function of pseudotime was established by fitting a spline to 
single-cell data. The 95% confidence interval is shown. n= 4,759, n= 8,653 and 
n=4,050in path 1, path 2 and path 3, respectively; n, number of cells. e, G12C 
output score across clusters. Median, upper and lower quartiles, and outliers 


upon treatment with G12Ci) (Fig. 1c, Extended Data Fig. 3a, b). Treat- 
ment with G12Ci sequestered most cells in a state with low output. 
Some drug-treated cells had high output (arrow in Fig. 1c), indicating 
diverging responses across the population. The change in output score 
revealed different fates along the trajectories (Fig. 1d, e, Extended Data 
Fig. 3c-e): cells in path 1 had inhibited output, and cells in path 2 had 
an initial inhibition followed by reactivation. 

The trajectories correlated with changes in cell-cycle-specific expres- 
sion signatures (Extended Data Fig. 4a). By classifying cells along the 
cell cycle, we found that treatment with G12Ci induced a quiescent 
state (GO) that was transcriptionally distinct from Gl (Fig. 1f, Extended 
Data Fig. 4b-d). This induction mirrored the inhibition of KRAS(G12C) 
output along the trajectories (Fig. 1g). Treatment with G12Ci also led to 
higher levels of protein expression for p21 and p27 (Fig. 1h, Extended 
Data Fig. 4e), two markers of quiescence’. Consistent with this, cell- 
cycle analysis of double-thymidine-synchronized cells showed that the 
G12Citreatment arrests cells in a GO or G1 state (Extended Data Fig. 4f). 

We used a quiescence biosensor” based ona cyclin-dependent-kinase 
binding-defective p27 mutant (p27K ) to monitor the subpopulations 
and to validate the results of the scRNA-seq analysis (Fig. li, Extended 
Data Fig. 5a). As predicted, the G12Ci treatment led to a bimodal cell 
distribution, comprising a subpopulation of quiescent cells with high 
p27K (about 80% of the total population) and a subpopulation of rap- 
idly adapting cells with low p27K (about 20%). This differed from the 
effect of inhibitors that target MEK or ERK, two kinases that are down- 
stream of KRAS(G12C) (Extended Data Fig. 5b, c). Although exposed 
to the G12Ci treatment for the same duration, cells with low p27K had 
higher active KRAS than cells with high p27K (Fig. 1j, Extended Data 
Fig. Sd) and were able to progress past the G1 restriction point (Fig. 1i, 
inset). Rechallenge with the G12Ci attenuated the adapting response 
to some degree, but it could not eliminate this population (Extended 
Data Fig. Sa, e). Furthermore, the inhibition of levels of KRAS-GTP by 
the G12Ci during rechallenge was less than its initial effect (Extended 
Data Fig. 5f). 
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are shown. f, g, Cell-cycle-phase distribution over time (f) or across clusters (g). 
h, Extracts from drug-treated KRAS(G12C)-mutant cells (H358) were analysed 
to determine the expression of the indicated proteins. i, H358 cells expressing 
the quiescence biosensor (mVenus-p27K ) were analysed by fluorescence- 
activated cell sorting (FACS). Inset, cell-cycle distribution of the indicated 
populations. j, Biosensor-expressing cells were treated, sorted and analysed to 
determine the levels of active and total KRAS. A representative of three 
independent experiments is shown inh-j.H, high p27K expression; L, low 
p27K expression; NA, not applicable. 


To identify the adaptive signals that are responsible for the divergent 
response to the G12Ci treatment, we used two orthogonal approaches 
(Methods). A differential expression analysis, which contrasted single 
cells from the inhibited and adapting trajectories, revealed many tran- 
scripts with trajectory-specific expression (Extended Data Fig. 6a). A 
genome-wide knockout screen identified single-guide RNA (SgRNA) 
targets that enhanced the effect of treatment with G12Ci (Extended 
Data Fig. 6b). After integrating the results from both approaches, we 
focused on genes with subpopulation-specific expression that were also 
functionally related to proliferation on treatment (Fig. 2a, b, Extended 
Data Fig. 6c—e). Of these, heparin-binding epidermal growth factor 
(HBEGF), aurora kinase A (AURKA) and KRAS were studied in more detail. 

The expression of HBEGF mRNA—which encodes aligand of the epi- 
dermal growth factor receptor (EGFR)”°—was downregulated shortly 
after treatment with G12Ci, but rebounded at 48-72 h (Extended Data 
Fig. 7a). The single-cell analysis ‘localized’ this rebound to a cluster 
of quiescent cells (Fig. 2a, b, Extended Data Fig. 6d). This change 
was associated with an approximately twofold increase in secreted 
HBEGF during the G12Ci treatment (Extended Data Fig. 7a). Consistent 
with a potential role in mediating adaptation to the G12Ci treatment, 
sgRNAs targeting EGFR signalling intermediates were depleted on 
treatment with G12Ci (Fig. 2c). Likewise, siRNA-mediated knockdown 
of HBEGF enhanced the antiproliferative effect of the drug (Extended 
Data Fig. 7b). 

The secretion of HBEGF could affect a broader population of cellsin 
an autocrine and/or paracrine fashion by activating EGFR, which—simi- 
lar to other receptor tyrosine kinases—drives nucleotide exchange to 
activate RAS”. Indeed, adapting cells had higher levels of EGFR pathway 
activation than quiescent cells (Fig. 2d). Stimulation with recombinant 
EGF induced KRAS activation in sorted quiescent cells and enhanced 
signalling in an EGFR- and SHP2-dependent manner (Extended Data 
Fig. 7c-e). EGF also enhanced the escape from quiescence (Fig. 2e) 
when the ligand was added during the adaptive phase of the G12Ci 
treatment (24-48 h), but not when it was added at the beginning. This 


© HBEGF =| ..R87| 2° 


a Inhibited Adapting + |] CDKNIA 
8 i PTEN 
Custes Q@--> >+0>@>@ o& oi EGFR >sHP2 = 
GRB2:SOS1 = HBEGF 
Fan EGFR 
KRAS KRAS &|GRB2 g 
(GDP) (GTP) ae @ Sire 
vy \ — : 
420 2 
p21 PAF PIS MAGeCK £ score 
RB1 “v 
G1i-s 
d unsorted LH p27K- 
b "ten an F 
N KRAS HBEGF AURKA otadgh NX G12Ci,h 
‘4 4 ; aase pEGFR 
Be 2 1 = ==] EGFR 
8 04 eee 0 ==] [ml pSHP2 
a 0 Rp ane 1 
& 24 Sw [ese RATES: —19 gee eee ==] SHP2 
T T T T | a T T T T a T T T T T a 
e ee e ee e e@e0 f-e=-=-- —=—] B-Actin 
e 9 Control — G12Ci 
f -e—EGFRi —— Combination 
2 Fy 8005 
%@ 1.0 @12Ci,72h ‘1.0 G12Ci, 72h «B49 
2 2 »72N 5 S400 
s +EGFatOh @ +EGFRi 2&8 4109= 
057 | +£GF at 24h gOS sHP2i « § 8 ‘sod ]P=2.008 
ro +EGF at 48h s Fe 04 Fp get 
eB 0 oH 0 -50/ 


or a 
0 7 14 21 28 


2345 2345 
p27K-, log(AU) p27K-, log(AU) Time (d) 


Fig. 2 | Adaptation to the G12Ci treatment is dependent on EGFR signalling. 
a,b, The peak (a) or mean (b) expression of genes with trajectory-specific 
expression. a, Cells from the indicated clusters were projected in their 
diffusion component coordinates. Cells with peak expression in the indicated 
genes are shown in navy. b, Cells were grouped by cluster, ordered in 
pseudotime and the mean expression was calculated for pools of 15 cells (grey) 
or the entire cluster (navy). c, A genome-wide CRISPR-Cas9 screen in H358 cells 
identified EGFR signalling intermediates as potential regulators of the G12Ci 
treatment. NT, non-targeting sgRNAs. SHP2is also known as PTPN11. 

d, Immunoblots of extracts from G12Ci-treated and FACS-sorted H358 p27K" 
cells. e, f, The cells were treated with the G12Ci for 72 halone, inthe presence of 
EGF stimulation at the indicated times (e) or in the presence of the indicated 
EGFR signalling inhibitors (f). g, Mice bearing xenografts of H2122 cells were 
treated as shown, to determine the effect on tumour growth. Mean +s.e.m, 
n=4 mice. Atwo-sided t-test Pvalue is shown. A representative of two 
independent experiments is shown in d-f. 


suggests that EGFR signalling is required for the escape from drug- 
induced quiescence. 

Inhibition of EGFR signalling, either by targeting EGFR or SHP2, 
attenuated the adaptive reactivation of KRAS-GTP in KRAS(G12C)- 
mutant lung cancer cells and ‘RASless mouse embryonic fibroblasts 
(Extended Data Fig. 7f-i). Cotargeting EGFR or SHP2 together with 
KRAS(G12C) attenuated the escape from drug-induced quiescence 
(Fig. 2f) and enhanced the antiproliferative and/or antitumour effect 
(Fig. 2g, Extended Data Fig. 7j). Broad suppression of receptor tyrosine 
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kinases by serum deprivation enhanced the antiproliferative effect of 
G12Ci in several models (Extended Data Fig. 7k), which suggests that 
additional growth factors may contribute in a tumour- or subpopula- 
tion-specific manner. Thus, EGFR signalling coordinates—at least in 
part—the heterogeneous response to the G12Ci treatment. 

Although it is predominantly activated in G2M to control cell division, 
AURKA has also previously been implicated in the regulation of mito- 
genic signalling”? (Fig. 3a) and acquired resistance to EGFR or PI3K 
inhibition”*”’. AURKA was upregulated along the adapting trajectory 
(Extended Data Fig. 6) and expressed at higher levels in sorted adapt- 
ing cells thanin sorted quiescent cells (Fig. 3b). sgRNAs against AURKA 
ranked highly in the screen (Fig. 3a). Individual knockout of AURKA 
augmented the antiproliferative effect of the G12Ci and prevented 
the reactivation of KRAS over time (Fig. 3c, d). Inhibitors targeting 
AURKA* suppressed the reactivation of KRAS-GTP during the G12Ci 
treatment, both in KRAS(G12C)-mutant cancer cells (Extended Data 
Fig. 8a) and RASless mouse embryonic fibroblasts (Extended Data 
Fig. 8b). Doxycycline-inducible AURKA expression not only enhanced 
the adaptive reactivation of KRAS-GTP and CRAF-MEK-ERK signalling 
during the G12Ci treatment, but also attenuated the antiproliferative 
effect of treatment with G12Ci (Extended Data Fig. 8c, d). 

AURKA interacts with wild-type HRAS in non-malignant cells to 
enhance its interaction with CRAF™. Here we found that AURKA also 
interacts with KRAS(G12C) in cancer cells (Extended Data Fig. 8e). Treat- 
ment with G12Ci or an AURKA inhibitor (AURKAi) displaced only CRAF 
or only AURKA, respectively, from KRAS(G12C). Treatment with acom- 
bination of G12Ci and AURKAi displaced both interactions to enhance 
pCRAF and pERK inhibition, relative to treatment with each drug alone 
(Extended Data Fig. 8f, g). This suggests that AURKA complexes with 
KRAS to stabilize its interaction with downstream effector CRAF. As 
expected, targeting AURKA prevented the escape from G12Ci-induced 
quiescence (Fig. 3e). Combined inhibition of AURKA and KRAS(G12C) 
had a synergistic antiproliferative effect across KRAS(G12C)-mutant 
models (Extended Data Fig. 8h) and resulted ina stronger antitumour 
effect in vivo, as compared to G12Ci monotherapy (Fig. 3f, Extended 
Data Fig. 8i,j). Thus, the heterogeneous bypass of G12Ci-induced qui- 
escence also depends on the AURKA-mediated adaptive reactivation 
of KRAS. 

The non-uniform response to the G12Ci treatment may occur 
in a KRAS(G12C)-dependent or KRAS(G12C)-independent manner. 
Unlike treatment with G12Ci, treatment with aKRAS°”“-specific siRNA 
(KRAS°“ siRNA)” yielded a uniform induction of quiescence (Fig. 4a). 
The KRAS“” siRNA differs from the inhibitor in that it suppresses sig- 
nalling in a manner that is not dependent upon conformation, but is 
otherwise susceptible to KRAS(G12C)-independent adaptive pressures. 
The uniform effect of the KRAS“¢ siRNA suggests that reactivation 
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Fig. 3 | AURKA is involved in the adaptive reactivation of KRAS and escape 
from drug-induced quiescence. a, AURKA signalling intermediates identified 
in the CRISPR-Cas9 screen as potential regulators of the response to the G12Ci 
treatment. b, Immunoblots of extracts from FACS-sorted H358 p27K cells. 
c,d, KRAS(G12C)-mutant lung cancer cells (H358) that express anon-targeting 
(NT) sgRNA or AURKA-specific sgRNAs were treated with the G12Ci and 


analysed by immunoblotting (c) or by crystal violet staining (d). e, Cells were 
treated with the indicated inhibitors for 72h and analysed by FACS. f, Mice 
bearing xenografts of H358 cells were treated with the indicated inhibitors to 
determine the effect on tumour growth. Mean +s.e.m,n=4 mice. A two-sided 
t-test Pvalue is shown. A representative of three independent experiments is 
showninb-e. 
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Fig. 4 | Newly synthesized KRAS(G12C) escapes trapping by the drug. a, Cells 
expressing the quiescence biosensor (H358 p27K , KRAS°?“ ) were transfected 
with KRAS-specific siRNAs targeting both wild-type and G12C alleles (siKRAS), 
or only the G12C allele (siG12C), for 72 h and analysed by FACS. The effect of 
a72-hG12Citreatment is shown. Inset, cell extracts were immunoblotted 

and quantified to determine the intensity of KRAS expression and ERK 
phosphorylation. b, Effect of the indicated treatments onKRAS mRNA. 

AcD, actinomycin D. c, Inhibitor-treated cell extracts were analysed by 
immunoblotting. d, e, Normalized KRAS expressionacross single cells asa 


of KRAS(G12C) is sufficient for the divergent response. However, 
drug-bound KRAS(G12C) cannot undergo nucleotide exchange to 
the active state* >. We therefore considered how KRAS(G12C) might 
be reactivated, when almost the entire initial pool is covalently bound 
and inhibited by the drug. 

The G12Ci treatment induced KRAS mRNA and KRAS protein expres- 
sion (Fig. 4b, c, Extended Data Fig. 9a). This induction was heteroge- 
neous across the population (Fig. 2a, b, Extended Data Fig. 6). It was 
inversely proportional to KRAS-RAF-MEK-ERK signalling activity 
(Extended Data Fig. 9a), and was most pronounced in clusters of quies- 
cent cells with maximal inhibition of G12C output (Fig. 4d, e). Inhibiting 
new KRAS synthesis with the transcription inhibitor actinomycin D 
or KRAS-specific siRNAs prevented the KRAS-GTP rebound during 
the G12Ci treatment (Fig. 4b, c, Extended Data Fig. 9b). Conversely, 
doxycycline-induced KRAS(G12C) expression attenuated the effect 
of drug treatment (Fig. 4f, Extended Data Fig. 9c). 

To confirm that KRAS(G12C) is sufficient for the divergent response 
to the G12Ci treatment, we used the KRAS“¢ siRNA to mimic the ini- 
tial inhibitory phase, and doxycycline-induced expression of siRNA- 
resistant KRAS“”“ (siRes-G12C) to mimic the adaptive phase triggered 
by new KRAS(G12C). As evidenced in Fig. 4g and Extended Data Fig. 9d, 
expression of siRes-G12C led to a bimodal distribution in cells trans- 
fected with KRAS©““ siRNA, with about 30% of cells escaping quiescence. 
This phenocopies the effect of the G12Ci, even though no drug was 
added in this experiment. 

The data suggest that some newly synthesized KRAS(G12C) under- 
goes nucleotide exchange to the active, drug-insensitive state before it 
is bound and inhibited by the drug. Indeed, EGF stimulation attenuated 
the inhibition of KRAS(G12C) when EGF was added before the G12Ci, 
but not when it was added afterwards (Extended Data Fig. 9e). This sug- 
gests that exposure to growth factors is the initial stimulus that affects 
the inhibitory fate of cells with new KRAS(G12C). AURKA probably 
operates later, to maintain active KRAS(G12C) and effector signalling. 

Our study thus sheds light on why treatment with a KRAS(G12C) 
inhibitor results predominantly in partial responses in patients with 
lung cancer?°. We identify an adaptive fitness mechanism that allows 
groups of cancer cells within a population to rapidly escape inhibition 
(Extended Data Fig. 10, Supplementary Discussion). The synthesis of new 
KRAS(G12C) and its distribution between the active or inactive states 
modulates the divergent response. Drug-induced quiescent cells with- 
out adequate expression of new KRAS(G12C) are eliminated from the 
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function of KRAS(G12C) output score (d) or in quiescent versus proliferating 
(prolif.) cells (e). CI, confidence interval. f, H358 cells engineered to express 
haemagglutinin (HA)-tagged KRAS(G12C) under a doxycycline (dox) -inducible 
promoter were treated with the G12Ci in the presence of doxycycline 

(0-2 pg mI’). g, Cells expressing the quiescence biosensor, engineered to 
stably express doxycycline-inducible siRes-G12C, were transfected with 
KRAS©”¢ siRNA, followed by doxycycline treatment (100 ng mI’). 
Arepresentative of three independent experiments is shownina-c, f,g. 


population by treatment. Cells with new KRAS(G12C)—which is rapidly 
converted to the active, drug-insensitive state—escape inhibition and 
resume proliferation. This occurs through signals that act upstream 
of KRAS: receptor tyrosine kinases trigger nucleotide exchange, and 
AURK signalling facilitates effector activation and cell-cycle progres- 
sion. In cells in which these signals are not active (or in cells in whichthe 
signals are pharmacologically suppressed), new KRAS(G12C) spends a 
longer time in its inactive conformation, in which it can be bound and 
inhibited by the drug. These cell-intrinsic events are sufficient for a rapid, 
multifactorial and non-uniform adaptive process that limits the thera- 
peutic potential of conformation-specific KRAS(G12C) inhibition. This 
mechanism must be suppressed for complete and durable responses 
in the clinic. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Cell culture and reagents 

Allcell lines used in this study were maintained in DMEM medium supple- 
mented with 10% FBS, penicillin, streptomycin and 2mM L-glutamine. Cell 
lines were obtained from ATCC (H358: CRL-5807; H2122: CRL-5985; and 
SW1573: CRL-2170), expanded immediately and frozen in aliquots. The cell 
lines tested negative for mycoplasma. All experiments were performed 
within 20 passages. The KRAS(G12C) inhibitor (ARS1620) was obtained 
from ChemGood and administered at 10 uM, or as otherwise specified, in 
culture and ata dose of 200 milligrams per kilogram body weight (mpk) 
in mice. The maximum selective concentration for in vitro dosing was 
experimentally determined to be 10 uM. At concentrations greater than 
10 uM, antiproliferative effects were observed in non-KRAS(G12C)-mutant 
cells, such as HEK293 cells. Inhibitors targeting AURKA (alisertib) and 
AURKA, AURKB and AURKAC (tozasertib) were obtained from Selleckem. 
These were administered at 10 pM, or as otherwise specified, in culture 
and at doses of 30 mpktwicea day for alisertib or of 50 mpk for tozasertib, 
in mice. Inhibitors targeting EGFR (gefitinib, 10 pM), pan-HER kinase 
(afatinib, 1 1M), SHP2 (SHPO99, 10 iM), MEK (trametinib, 25 nM), ERK 
(SCH772984, 500 nM), PI3K (BYL719, 1 1M), AKT (MK-2206, 2 uM) or RAL 
(BQUS7, 10 pM) were obtained from Selleckem, and were administered 
inculture at the above concentrations, unless otherwise indicated. Gefi- 
tinib was administered at a dose of 100 mpk in mice. Actinomycin D was 
purchased from Sigma Aldrich and used at a concentration of 2.5 pg/ml. 


Rationale and approach 
The goal of this study was to determine how KRAS(G12C)-mutant can- 
cer cell populations adapt to treatment with a conformation-specific 
KRAS(G12C) inhibitor. These inhibitors bind only to the inactive con- 
formation, and spare the active state of KRAS(G12C). They inactivate 
KRAS(G12C) by trapping the oncoprotein in its inactive state and prevent- 
ing its reactivation by nucleotide exchange. Inhibition occurs because— 
rather than existing in a constitutively active state—KRAS(G12C) 
hydrolyses GTP to GDP and undergoes nucleotide cycling in cancer cells. 
We reasoned that cells across a population do not have synchronized 
KRAS(G12C) nucleotide cycles. Various perturbations (such as drug 
treatment) are likely to diversify the population, altering the distribution 
of cells with predominantly active or inactive KRAS(G12C). On the basis 
of this reasoning, we hypothesized that the adaptive reactivation during 
the G12Ci treatment occurs by shifting the KRAS(G12C) equilibrium to 
the active, drug-insensitive state and reflects anon-uniform behaviour 
of cancer cells, ina manner that is determined by the distribution of the 
nucleotide-bound states of KRAS(G12C) (thatis, the distribution of cells 
with predominantly active or inactive KRAS(G12C) after treatment). 
The cells were treated with the G12Ci for a short time to minimize the 
chance of acquired or selected genomic alterations. Treatment was car- 
ried out in the absence of the tumour microenvironment to minimize 
the potential confounding effect of stromal interactions. scRNA-seq 
was chosen because it allows for: (1) an analysis of thousands of single 
cells to determine phenotypic differences in treatment response; and 
(2) the determination of transcriptional changes across the popula- 
tion. It is well established that only the GTP-bound conformation of 
KRAS activates effector signalling, which in turn leads to changes in 
transcriptional output”. With this in mind, the collective expression 
of KRAS(G12C)-dependent genes was used to infer the activation status 
of KRAS signalling in each single cell. 


scRNA-seq 
Experiment. KRAS(G12C)-mutant tumour cell models (H358, H2122 and 
SW1573) were treated with the KRAS(G12C) inhibitor (ARS1620, 10 pM) 


for 0, 4, 24 and 72 h, followed by rapid collection of attached cells in 
cryopreservation medium. The cells were stored at —80 °C until the 
completion of the experiment. scRNA-seq was carried out using the 
Chromium 10X platform (3-prime v.1), following the manufacturer’s 
protocol, as previously described* **. In brief, single-cell suspensions 
were loaded ona GemCode single-cell instrument to generate single-cell 
gel bead emulsions (GEMs). Each GEM contains sequencing adapters and 
primers, a barcode used to index cells, arandomer or unique molecular 
identifier (UMI) used to count transcripts and an anchored oligo-dT to 
prime polyadenylated RNA transcripts. Cells were loaded at a limiting 
dilution to minimize the co-occurrence of two or more cells in the same 
GEM. The cells were lysed, and their mRNA was reverse-transcribed fol- 
lowed by disruption of the emulsions. Subsequently, barcoded cDNA 
was pooled followed by shearing, end repair and A-tailing, ligation of 
adaptors and another round of PCR amplification to generate samples 
carrying properly oriented adaptors. The libraries were then sequenced 
with HiSeq 2500 in paired-end mode. Alignment, barcode assignment 
and UMI counting were carried out using the Cell Ranger Single-Cell 
Software Suite. Then, sample demultiplexing was performed to gener- 
ate FASTQ files for the 14-bp barcode, the 10-bp UMI tag and the cDNA 
insert. The latter was aligned to the human reference genome using 
STAR. Cell barcodes and UMIs were filtered to ensure correct assignment 
and elimination of mismatches, and PCR duplicates were marked and 
removed. The number of reads that provided meaningful information 
was calculated as the product of four metrics: valid barcodes, valid 
UMI, associated with a cell barcode and confidently mapped to exons. 


Initial processing. The gene-cell count data matrices generated for 
each treatment collection time were consolidated into a single matrix 
representing each tumour cell line and assembled into a SingleCellEx- 
periment object using R and Bioconductor tools®. Anumber of quality- 
control metrics were derived using the scater package*’. Before merging 
the datasets from different KRAS(G12C) models, we performed quality 
control on each model separately. Cells that had a total UMI count, to- 
tal gene count or per cent mitochondrial count greater or lower than 3 
mean absolute s.d. were excluded from the analysis, as these cells may 
represent doublets or cell debris. To account for the possibility that dif- 
ferences in these variables may be treatment-related, filtering was carried 
out individually for each treatment time point. Despite two independent 
experimental replicates leading to sequencing of several thousand single 
cells, no good-quality cells (that is, suitable for downstream analysis) 
were identified from the 24-h treatment time point in H2122 cells. Genes 
expressed only ina small proportion of cells (that is, less than 5% of the 
cells) were excluded from downstream analysis. Once filtered, the data- 
sets from each tumour model were joined together. Only genes detected 
in all three models were included in the joined dataset. Together, this 
filtering resulted in a dataset with 10,177 cells and 8,687 genes, in which 
each cell had an average of about 3,000 expressed transcripts. 


Dimensionality reduction. This was carried out on the merged data- 
sets from ‘Initial processing’, using a zero-inflated negative binomial- 
based wanted variation extraction (ZINB-WaVE) method”. In brief, the 
method fits a model that accounts for dropouts, overdispersion and the 
count nature of the data (suchas those recorded in scRNA-seq experi- 
ments). The model enables both cell-level and gene-level intercepts, 
which serve as global-scaling normalization factors. With this in mind, 
we controlled for both gene-level (that is, per cent dropout) and cell- 
level (that is, batch or tumour of origin, transcript count, mitochondrial 
transcript count and ribosomal transcript count) effects, adding them 
as covariates in the model. The K parameter was chosen empirically, by 
evaluating K=10, 5,3 and 2. K=2 was found to optimally reduce batch 
effects. The effect of the regression is shown in Extended Data Fig. 2a-f. 


Normalization. Size factors were computed by using the scran nor- 
malization method*. In brief, cells were first clustered using the 


quickCluster command. Cells in each cluster were normalized using 
a deconvolution method embedded in the scran package, which im- 
proves normalization accuracy by reducing pooling together cells 
in the same cluster. Only genes with a mean count of greater than 0.1 
across the dataset were used in this process. The size factors generated 
through this approach were then used to compute normalized expres- 
sion values as log-transformed count data (Supplementary Data1-3). 


Modelling technical variance. After normalization, we modelled 
technical noise by fitting a mean-dependent trend to the variances of a 
set of endogenous control genes (for example, ribosomal genes). This 
was carried out as previously described**. Genes with a variability that 
surpassed that expected on technical grounds alone were deemed to 
have a high degree of biological variability. Highly variable genes were 
used to construct two-dimensional projections. 


Two-dimensional projections. To carry out downstream analysis 
and visualize the single cells, we used the two-dimensional output of 
ZINB-WaVE t-SNE or diffusion component analysis”. The latter two 
projections were carried out using commands embedded inthe scatter 
package with default parameters, which included using the top 500 
most-variable genes to carry out the projection. t-SNE was performed 
on low-rank approximations of the two-dimensional output of ZINB- 
WaVE (which controls for the potential confounding factors noted in 
‘Dimensionality reduction’). Initial runs were carried out with default 
parameters. The final projections were generated by using a perplexity 
parameter of 50 and a theta parameter of 0. A local sigma parameter 
was used for the diffusion component projection. 


Clustering. The t-SNE projection was used to establish a distance 
matrix, followed by clustering of cells by density peaks, as previously 
described*°. We then used rho and delta thresholds (200 and 15, respec- 
tively) to determine cluster peaks and assign single cells to these clusters 
(Supplementary Data 2). The performance of the clustering algorithm 
was examined by silhouette-width analysis (Extended Data Fig. 2j), which 
reported a mean width per cluster ranging between 0.18 to 0.52. 


Trajectory inference analysis. The Slingshot algorithm” was used 
to order and project cells into principal curves representing distinct 
trajectories. The arc length along each principal curve represents pseu- 
dotime, a computational parameter denoting progress along a bio- 
logical process“. To establish trajectories associated with KRAS(G12C) 
inhibition, the cells collected from all treatment time points were first 
clustered in the two-dimensional reduced space. The Slingshot algo- 
rithm was then used to reconstruct the clusters into trajectories. The 
algorithm was anchored by indicating a starting cluster—that is, the 
cluster with the greatest number of cells collected at O h (cluster 7). 
The trajectories and end points were otherwise estimated in an unsu- 
pervised manner and are graphically represented in Extended Data 
Fig. 2k. When comparing expression changes between trajectories (for 
example, Fig. 1d, Extended Data Fig. 6c) single cells were grouped by 
trajectory and assigned the pseudotime value corresponding to that 
trajectory. In this comparison, cells before the bifurcation points were 
assigned to both trajectories, and cells after the bifurcation were as- 
signed toa single trajectory (Supplementary Data 4). When comparing 
expression changes between clusters or cell-cycle modes (for example, 
Fig. le, Extended Data Fig. 6d), the cells were grouped by these fac- 
tors and pseudotime was not considered in the differential expression 
model (but was used for visualization, as described in ‘Visualization’). 
In graphs, pseudotime was adjusted across trajectories (range of 0 to 
100) to enable comparisons in expression trends. 


Differential expression. Dropout events in sCRNA-seq experiments 
make analysis of differential gene expression challenging. We used the 
zero-inflated negative binomial model described in ‘Dimensionality 


reduction’ to identify excess zero counts and generate gene- and cell- 
specific weights. These can be used to adapt bulk RNA-seq differen- 
tial gene-expression pipelines for zero-inflated data”. With this mind, 
we applied methods from the limma package** to perform differential 
gene expression, using observational weights to adjust for zero in- 
flation. First, we identified genes with an expression that differed as 
a function of pseudotime between inhibited (path 1) and adapting 
(path 2) trajectories. The cells were grouped by trajectory, and gene 
expression over pseudotime was contrasted between path 1 and path 
2, while controlling for potentially confounding factors (batch, total 
features, total counts, per cent mitochondrial counts and per cent 
ribosomal counts) and correcting for multiple-hypothesis testing. The 
false discovery rates from this test are listed in the br.qval column in 
Supplementary Data 3. A similar analysis was carried out inthe subset 
of cells collected only at 72 h, to determine whether the differences 
persisted in cells collected at the same treatment time. We also ex- 
plored differences in gene expression between cell clusters or cell- 
cycle modes; that is, quiescent (GO) versus proliferating cells (G1S, S, 
G2M, M and MGI). Significant differences were tested using limma*®. 
These results are listed in the cluster.qval and mode.qval columns in 
Supplementary Data 3. Key differentially expressed genes are mapped 
in Extended Data Fig. 6a, d. The false discovery rates corresponding 
to KRAS, HBEGF and AURKA are shown in Extended Data Fig. 6e. Given 
the limitations in performing differential gene-expression analysis in 
scRNA-seq datasets, efforts were made to experimentally validate key 
findings from this analysis. 


Visualization. To visualize KRAS, HBEGF and AURKA, the cells were 
ordered by cluster and pseudotime and then the normalized expression 
values (see ‘Normalization’) were averaged across pools of n adjacent 
cells (n = 15 for the plots in Figs. 2b and 4d or n= 40 for the heat maps 
in Extended Data Fig. 6). In some instances, the last pool per cluster 
contained more than n, but fewer than 2n, cells. This was done in an 
effort to compensate for random dropout events in each single cell 
and to minimize the risk of visualizing technical outliers. The averaged 
expression was scaled across all pools and then presented graphically. 
The projection of peak gene-expression levels into two-dimensional 
coordinates (Fig. 2a) was carried out by identifying and labelling the 
cells with scaled expression equal or greater than 2s.d. from the mean of 
the entire dataset. Projections of expression signature scores (Extended 
Data Fig. 3) were carried out ina similar manner without a cut-off. The 
expression trend for a gene or a gene set was established by fitting a 
spline along with its 95% confidence interval to the single-cell data. 


Bulk RNA sequencing and output score determination 

Bulk RNA-seq. KRAS(G12C)-mutant lung cancer cells were treated 
with the G12Ci for O, 4, 24 and 48 hin biological triplicates. RNA was 
extracted using the RNeasy Mini Kit (Qiagen catalogue no. 74104), 
according to the manufacturer’s instructions. After RiboGreen quan- 
tification and quality control by Agilent BioAnalyzer, 500 ng of total 
RNA per sample underwent polyA selection and TruSeq library prepara- 
tion according to instructions provided by Illumina (TruSeq Stranded 
mRNALT Kit, catalogue no. RS-122-2102), with 8 cycles of PCR. Samples 
were barcoded and run on HiSeq 4000 in a50-bp and 50-bp paired-end 
run, with an average of 30 million paired reads per sample. Ribosomal 
reads represented less than 0.5% of the total reads generated. The se- 
quencing output files from different lanes were concatenated, aligned 
to GRCH38 using HISAT2 and transcripts were counted using HTSeq 
in Python. The count data matrix was then processed in R by using 
limma* and edgeR“ from Bioconductor, as previously described*. 
In brief, the data were filtered by removing transcripts that were not 
detected in all replicates. Size-factor normalization was carried out and 
differential expression analysis was performed contrasting each time 
point to the untreated condition. The count data were transformed to 
log,-transformed counts per million followed by an estimation of the 
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mean-variance relationship, which was then used to compute appropri- 
ate observation-level weight data. The data for each gene were used to 
fit a linear model and to compute various statistical parameters fora 
given set of contrasts. Differentially expressed genes were considered 
to be those with log,-transformed fold change equal or greater than 2 
in any contrast and an adjusted P value of less than 0.05. These genes 
(563 induced, and 447 suppressed, by KRAS(G12C)) comprised the lung- 
cancer-specific KRAS(G12C)-dependent gene-expression signature 
that was used to calculate the G12C output score across single cells 
(Supplementary Data5). 


Output score determination. The list of KRAS(G12C)-dependent genes 
was filtered to remove genes with undetected or very low expression 
in the single-cell dataset (that is, average log count of less than 0.1). 
The G12C output score in each single cell was the average log count 
expression of the remaining KRAS(G12C)-dependent genes (212 genes, 
Supplementary Data 6), normalized across all cells in the dataset. The 
G12C-induced and -suppressed scores were calculated from genes that 
were, respectively, down- or upregulated by the G12Ci treatment in 
the bulk RNA-seq experiment. When not specified, G12C output refers 
to the score derived from genes that were downregulated during the 
G12Ci treatment (these comprised the majority of the KRAS(G12C)- 
dependent genes detected in the scCRNA-seq dataset). The trend in G12C 
output score as a function of pseudotime was visualized by fitting a 
spline and its 95% confidence interval (Fig. 1d). The G12C output score 
in each single cell is shown in Extended Data Fig. 3b-d. The rotation 
gene set test* was used to compare the G12C-induced or -suppressed 
gene sets across pseudotime in path 1 versus path 2 (P= 0.001 for the 
G12C-induced gene set and P= 0.001 for the G12C-suppressed gene set) 
(Fig. 1d). The G12C output score in clusters (Fig. le) was compared by 
using analysis of variance (ANOVA) (P= 0.001) and then in a pairwise 
fashion by using the Tukey test and correcting for multiple hypothesis 
testing (P=0.001 for each of—but not limited to—the following cluster 
comparisons: 7 versus 9, 7 versus 10, and 10 versus any other). The dif- 
ference in G12C output score between quiescent or proliferating cells 
(Extended Data Fig. 5d) was compared by atwo-tailed t-test (P< 0.001, 
in each cell line). Again, these findings were validated experimentally. 

Cell-cycle classification was performed by using published cell- 
cycle-phase-specific gene-expression signatures (GO, GIS, G2M, M 
and MG1)’*. The overlap between the gene sets is shown in Extended 
Data Fig. 4b. These were used to calculate a phase-specific score, ina 
similar manner to that described for the G12C output score, followed 
by scaling the scores across cells and cell-cycle phases. The cells were 
classified by assigning the cell-cycle phase corresponding to the high- 
est scaled value. The scaled values are graphically depicted inthe heat 
map in Extended Data Fig. 4c. 


Identification of candidate genes for further analysis 

The objective of this part of the study was to identify the genes respon- 
sible for the divergent response to treatment. We identified a large 
number of genes that were differentially expressed between the inhib- 
ited and adapting single-cell trajectories. The zero-inflated nature of 
scRNA-seq data makes it difficult to reliably estimate the magnitude 
of changes in expression. With this in mind, in an effort to unbiasedly 
select candidate genes for further validation and analysis we resorted 
toa CRISPR-Cas9 screen. By integrating the results from the screen and 
single-cell analyses, we aimed to identify key genes and/or pathways that 
are differentially expressed between the single-cell trajectories and that 
are also functionally related to proliferation during the G12Ci treatment. 


CRISPR-Cas9 screen 

Library lentivirus production and purification. The Brunello hu- 
man genome sgRNA library” was obtained from the gene editing and 
screening core facility at Memorial Sloan Kettering Cancer Center 
(MSKCC). This library consists of four sgRNAs per gene, optimized 


to reduce off-target effects. To produce the lentivirus expressing the 
sgRNA library, HEK293FT cells were transfected with the DNA library 
using XtremeGene HP (Sigma-Aldrich), according to the manufacturer’s 
instructions. After 24 h, the medium was replaced with fresh complete 
DMEM. Following another 48-h incubation, the medium was collected 
and centrifuged at 500g at 4 °C for 5 min to pellet cell debris. The su- 
pernatant was passed through a 0.45-um filter and was snap-frozen in 
aliquots for future use. 


Establishment of Cas9-expressing cells. Lentivirus expressing 
spCas9 was produced as above and transduced into H358 cells at a 
multiplicity of infection of 1. Cells were sorted into 96-well plates at one 
cell per well and selected for growth in complete medium containing 
10 pg/ml blasticidin. Cas9-expressing cell lines were validated for Cas9 
expression, Cas9 activity and a similar sensitivity to the G12Ci as the 
parental cell line (data not shown). 


Titration of the library-expressing virus. To determine the titre of the 
viral library, Cas9-expressing H358 cells were seeded in15-cm dishes in 
complete medium and transduced with increasing amounts of virus. 
The number of viable cells after puromycin selection was determined 
and used to calculate titre. 


G12Ci screen. Cas9-expressing H358 cells were transduced with the 
sgRNA-library-expressing virus at a multiplicity of infection of 0.3 and 
a coverage of 500x. Puromycin was added 2 days after transduction 
at aconcentration of 2 ug/ml. Eight days after selection (set as to), live 
cells were collected anda portion (40 million cells, which represents a 
500x coverage of the library) was collected for genomic DNA (gDNA) 
extraction (Qiagen Gentra Puregene Cell Kit). The remaining cells were 
reseeded into 150-mm? dishes at 4 million cells per dish and treated 
with DMSO or G12Ci (ARS1620, 10 1M). The medium was changed every 
three days. Cells were passaged at 80% confluence, and at least 500x 
coverage of the library was maintained at each passage. Fourteen days 
after treatment, cells were collected and gDNA was extracted. 


Next-generation sequencing library preparation. The gDNA extract- 
ed in ‘G12Ci screen’ were used to generate next-generation sequenc- 
ing libraries. The sgRNA sequences were amplified and prepared for 
deep sequencing through atwo-step PCR. The first step was performed 
with 10 pg input gDNA ina 100-1 reaction (260 pg gDNA per sample) 
with 1 pl of Herculase polymerase (Agilent). The PCR primers included 
adapters: ATGGACTATCATATGCT TACCGTAACTTGAAAGTATT TCG (v2 
adaptor forward) and CTTTAGTTTGTATGTCTGT TGCTAT TATGTCTACT 
ATTCTTTCC (v2 adaptor reverse). The thermocycling parameters 
were: 95 °C for 5 min, then 95 °C for 20 s; 60 °C for 20 s; 72 °C for 30s 
for 30 cycles, and 72 °C for 3 min. The number of cycles was tested to 
ensure it fell within the linear phase of amplification. Amplicons for 
each sample were pooled and the second step of PCR was performed 
with 8 x100-l reactions containing 5 pl of pooled first PCR amplicons 
to attach Illumina adaptors and indexes with NEBNext High-Fidelity 2x 
PCR Master Mix. The thermocycling parameters were: 98 °C for 3 min, 
then 98 °C for 20s; 68 °C for 20 s; 72 °C for 20s for 12 cycles, and 72 °C 
for 5 min. All parallel PCRs were pooled and purified by one round of 
phenol extraction using PhaseLock tubes (ref 2302820 VWR) according 
tothe manufacturer’s instruction. PCR products were eluted with 30 pl 
of EB buffer and resolved ona 2% agarose gel. Amplicons with desired 
size were purified from the gel with Qiagen’s gel purification kit. Final 
next-generation sequencing amplicons were eluted from column with 
EB buffer and concentrations were determined via Qubit. Samples were 
diluted to 30 nM before sending for sequencing. 


Data analysis. This was carried out using MAGeCK* in Python. MA- 
GeCK uses the beta score to estimate gene dependencies: a positive 
beta indicates that a gene is positively selected, and a negative beta 


indicates that a gene is negatively selected. The fold change in expres- 
sion was also calculated to help interpret and filter the results. This 
approach was used to determine the sgRNAs that are depleted during 
treatment. Gene-specific values were calculated as the mean changein 
expression for the four sgRNAs in the library targeting that gene. The 
non-targeting average was calculated from 1,000 non-specific sgRNAs. 
We prioritized genes with 2 or more sgRNAs that were downregulated 
by 2-or-more-fold in the G12Ci versus t) comparison, and which were 
also identified as having trajectory-specific expression in the scRNA- 
seq analysis. Preference was given to pathways being represented by 
several intermediates. Key findings were validated by independent 
genetic or pharmacological approaches. 


Cloning and plasmids 

sgRNAs targeting AURKA (guide 1: 5’-CCATATAGAAAATAATCCTG and 
guide 2: 5’-CCTGAAAACTCACCGAAGGT) were cloned into lentiGuide- 
Puro vector (a gift from F. Zhang; Addgene plasmid no. 52963) using the 
BsmBl site. The pMXs-IP-mVenus-p27K vector was a gift from T. Kitamura. 
siRNA-resistant HA-KRAS“ (that is, siRes-G12C) construct was generated 
by modifying the siRNA-targeted sequence (5’-G GIG GGT GCA TGC GGA 
GT-3’; the G12C codonis underlined), and the modified HA-KRAS““ was 
synthesized and cloned into the pENTR/D-TOPO vector (Invitrogen). 
The gene was then inserted into the pInducer20 vector (a gift from S, 
Elledge; Addgene plasmid no. 44012) or pLIX_403 vector (a gift from D. 
Root; Addgene plasmid no. 41395) using the Gateway LR Clonase II kit 
(Invitrogen). AURKA was amplified with primers (forward 5’-CACCATGG 
ACCGATCTAAAGAAAACT; reverse 5’-CTAAGACTGTTTGCTAGCTG) from 
the template pAuroraA-GFP-AURKA-mCherry (a gift from M. Tramier; 
Addgene plasmid no. 99878) and cloned into pInducer20 vector. 


Virus production and generation of stable cell lines 

HEK293T cells were seeded at 90% density in a 10-cm dish and trans- 
fected with the expression vector and packaging vectors pMD2.G and 
psPAX2 using Lipofectamine 3000 (Invitrogen) according to the manu- 
facturer’s instructions. Conditioned medium containing recombinant 
viruses was collected and filtered through 0.45-ym filters (Millipore). 
The virus-containing medium was added to cells with 8 pg/ml Polybrene 
(Millipore) overnight. Approximately 24 h after infection, the cells were 
selected with 2 pg/ml puromycin or 500 ng/ml G418. 


Flow cytometry 

Quiescence biosensor distribution. Three million H358 cells express- 
ing the mVenus-p27K’ biosensor” were plated in 6-cm dishes and 
treated with the indicated inhibitors for 72 h. For siRNA experiments, 
5 x 10° cells were seeded in 6-cm dishes and transfected with 25 nM 
KRAS@” siRNA (Sense: GUU GGA GCU UGU GGC GUA G-dTdT; antisense: 
CUA CGC CAC AAG CUC CAA C -dTdT; underline highlights the G12C 
codon) using Lipofectamine RNAiMAX (Invitrogen). Doxycycline was 
added to a final concentration of 100 ng/ml and cells were incubated 
for 72 h. Cells were collected with TrypLE Express (Gibco) and fixed 
for 10 min at room temperature with 4% paraformaldehyde. Samples 
were washed twice with PBS and filtered through 35-um mesh-capped 
collection tubes. If cell-cycle analysis was performed, cells were sub- 
sequently treated with 70% ethanol and stained with 5 pg/ml DAPI. 
Cells were analysed on BD Biosciences LSR Fortessa. Forward and side 
scatter plots were used to exclude debris, damaged cells and doublets. 
Data analysis was performed with FCS Express 6 Flow and GraphPad 
Prism 7.0 software. Where indicated, the p27K biosensor distribution 
was subjected to minimum and maximum normalization in Prism, to 
compare between different conditions. Cell sorting was performed on 
unfixed cells on BD Biosciences FACSAriall. Post-sort purity was 95%+ 
and was determined by running a small aliquot of the collected sample. 


Synchronization experiments. H358 cells were seeded at low den- 
sity (10-20%) and incubated with 2 mM thymidine for 22 h. Next, the 


cells were released in complete medium supplemented with 25 uM 
2’-deoxycytidine for 10 h, followed by another 20 h incubation with 
2mM thymidine. Following synchronization at the G1-S boundary, cells 
were released in complete medium containing either DMSO or G12Ci 
(10 uM), and collected for analysis at the indicated times. Cells were 
fixed in 70% ethanol, washed twice with PBS and stained in propidium 
iodide staining solution (PBS with 100 pg/ml RNase and 50 pg/ml pro- 
pidium iodide). Samples were analysed as in ‘Quiescence biosensor 
distribution’. Cell-cycle distribution was determined using the Multi- 
cycle DNA function embedded in FCS Express 6 Flow. 


Clonogenic assay 

The cells were seeded in 6-well plates at densities of 4 x 10° or 1x 10° cells 
per well and treated as indicated for 6 or 12 days, respectively. Media 
was changed every 3 days. Cells were fixed with ice-cold methanol for 
10 min and stained with 0.5% crystal violet solution for 30 min at room 
temperature on shaker. Plates were washed thoroughly and scanned. 


KRAS(G12C)-expressing RASless mouse embryonic fibroblasts 
RASless (that is, NRAS” HRAS “, Lox-KRAS-Lox, 4HT-cre) mouse embry- 
onic fibroblasts were generated and provided by M. Barbacid”. These 
were infected with a retrovirus expressing HA-tagged KRAS(G12C) 
(generated from a pBABEPuro DNA vector) followed by puromycin 
selection to establish mouse embryonic fibroblasts expressing exog- 
enous KRAS(G12C) and endogenous wild-type KRAS. The cells were 
then treated with 4-hydroxy-tamoxifen for one week to ablate the 
endogenous KRAS allele. The latter were used to determine the effect 
of various treatments on KRAS(G12C) reactivation. 


RAS activation assay 

These assays were performed as previously described” using the 
Active Ras Pull-Down and Detection Kit (Thermo Fisher Scientific). In 
brief, whole-cell lysates were incubated with GST-RAF1 RAS-binding 
domain (RBD) and glutathione agarose resin for 1h at 4 °C, followed 
by 3 washes and elution with SDS-PAGE loading buffer. The samples 
were then subjected to western blotting with a KRAS-specific antibody. 
When HA-KRAS(G12C) was exogenously expressed, an HA-specific 
antibody enabled specific determination of KRAS(G12C) in its GTP- 
bound conformation. 


Immunoprecipitation 

H358 doxycycline-inducible HA-KRAS(G12C) cells were plated in 6-cm 
dishes at 50% confluence. After induction with 2 pg/ml of doxycycline 
for 24h, cells were treated with G12Ci or AURKAi or the combination for 
another 2 h. Cells were then collected and 200 pg of protein lysate was 
used to immunoprecipitate HA-KRAS(G12C) with Pierce Anti-HA mag- 
netic beads according to the manufacturer’s protocol (Thermo Scientific 
no. 88836). Beads were eluted with SDS-PAGE loading buffer and the 
immunoprecipitation product was subjected to western blotting analysis. 


Immunoblotting 

Immunoblotting was performed as previously described*? *!, with the 
antibodies listed in Supplementary Table 1. All antibodies have been 
validated either in our previous work (KRAS, HA-tag, pEGFR, EGFR, pCRAF, 
CRAF, pERK, ERK, pRSK, RSK and GAPDH) or in other publications (B-actin, 
pSHP2, SHP2, pAURKA, AURKA, PLK1, CCNBI1, cleaved PARP, p21and p27). 


Cell viability assay 

Cells were seeded in 96-well plates at 2 x 10° cells per well in (at mini- 
mum) triplicates and treated with the indicated concentrations of 
drugs. After 72 h, cell viability was assayed by CellTiter-Glo Lumines- 
cent Cell Viability Assay (Promega). For siRNA experiments, cells were 
reverse-transfected with 25 nM siRNA for 3 days before the addition of 
drug. siRNAs targeting HBEGF and KRAS are ON-TARGETplus Smartpool 
siRNA from Dharmacon, and the non-targeting siRNA (control siRNA-A) 
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is from Santa Cruz Biotechnology. Data analysis was performed with 
GraphPad Prism 7.0. 


Synergy determination 

To determine the presence of synergy between two drug treatments, 
cells were treated with increasing concentrations of either drug for 
72h, followed by determination of viable cells using the CellTiter-Glo 
Luminescent Cell Viability Assay (Promega). The experiment was car- 
ried out in biological triplicates. The data were expressed as percent- 
age inhibition relative to baseline, and the presence of synergy was 
determined by the Bliss method using the synergy finder R package™. 


Quantitative RT-PCR 

Cells were treated as shown and total RNA was extracted using the Quick- 
RNA Miniprep Kit (Zymo Research). One microgram of RNA was used for 
cDNA synthesis using the High-Capacity cDNA Reverse Transcription 
Kit (Applied Biosystems). qPCR reactions were performed using the 
Applied Biosystem PowerUP SYBR Green Master Mix (Applied Bio- 
systems) on the QuantStudio 6 Flex Real-Time PCR System (Applied 
Biosystems). Samples were run in triplicate, and MRNA levels were 
normalized to ACTB. Primer sequences are: KRAS (forward, GGACTGG 
GGAGGGCTTTCT; reverse, GCCTGTTTTGTGTCTACTGTTCT), HBEGF 
(forward, ATCGTGGGGCTTCTCATGTTT; reverse, TTAGTCATGCCC 
AACTTCACTTT) and ACTB (forward, CATGTACGTTGCTATCCAGGC; 
reverse, CTCCT TAATGTCACGCACGAT). 


Mouse studies 

These studies were carried out as previously described*°™.. In brief, 6-to- 
8-week-old female nu/nu athymic mice were obtained from the Envigo 
Laboratories and maintained in compliance with Institutional Animal 
Care and Use Committee (IACUC) guidelines under protocol 18-05-007 
approved by MSKCC IACUC. The maximum tumour measurement per- 
mitted was 1.5 cm, and this was not exceeded in any of our experiments. 
Mice implanted with xenografts were chosen for efficacy studies inan 
unbiased manner. Once tumours reached 100-mm? volume, mice were 
randomized and treated with drug or the appropriate vehicle control. 
Treatments and tumour measurements were performed as previously 
described*™ in a non-blinded manner by aresearch technician who was 
not aware of the objectives of the study. Prism (GraphPad Software) 
was used for data analysis. For each study arm, the fractional differ- 
ence intumour growth relative to ¢, was plotted over time. Statistically 
significant differences were determined for each treatment time point 
by using the two-tailed t-test function embedded in Prism. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The data that support the findings of this study are available within 
the paper and its Supplementary Information files. Source Data for 
Figs. 1-4 and Extended Data Figs. 2-9 are provided with the paper. The 
scRNA-seq data have been deposited in the Gene Expression Omnibus 
(GEO) with the accession code GSE137912. Data or other materials are 
available from the corresponding author upon reasonable request. 


Code availability 


The analysis was performed using standard protocols with previously 
described computational tools. The scripts, along with the processed 
files described in the Methods, are available from the corresponding 
author upon reasonable request. 
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Extended Data Fig. 1| The effect of the G12Ci treatment on KRAS signalling 
across lung cancer cell lines. The indicated models were treated with 
increasing concentrations of the G12Ci (ARS1620) for 2h (top panels) or with 
10 uM over time (bottom panels), and immunoblotted to determine the effect 


onKRAS signalling intermediates. Key genetic alterations found at baselinein 
the KRAS(G12C)-mutant cell lines used in this study are listed. A representative 
of two independent experiments for each cell line is shown. 
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Extended Data Fig. 2| Quality assessment and processing of sCRNA-seq 
data.a,b, Gene counts asa function of UMI count. Cells are grouped by length 
of G12Ci treatment (a) or tumour model (b). c, The number of cells expressing a 
gene, asa function ofits average count across the dataset. d, Variance asa 
function of mean expression. Technical variance (that is, variability attributed 
to technical factors) was calculated by the expression of ribosomal genes. 
n=10,177 single cells ina-d.e, The per cent of variance explained by various 
experimental factors. Anumber of variables had a meaningful contribution to 
the variance of the dataset (that is, they accounted for greater than 1% of the 
variation), suggesting the need to correct for these potentially confounding 
factors in downstream analysis. f, Dimensionality reduction and covariate 
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regression using the ZINB-WaVE algorithm. The K parameter of 2 was chosen, as 
this minimizes batch and other covariate effects. g, t-SNE projection showing 
single cells coloured by length of inhibitor treatment. h, Parameters used to 
cluster cells by using the Density Cluster algorithm. i, Cluster distributionin 
the indicated projections (top) and cell line composition of each cluster 
(bottom), showing a similar representation of cells from different tumour 
models in each cluster. j, Silhouette-width analysis to assess the 
appropriateness of clustering. Negative values indicate cells that have been 
inappropriately assigned. k, t-SNE projection of KRAS(G12C) single cells with 
the three inhibitory trajectories identified by the Slingshot algorithm. 
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Extended Data Fig. 4| G12Ci treatment induces quiescenceina 
subpopulation of cancer cells. a, Single cells were analysed to determine 
gene-expression signatures that correlated with the inhibitory fates. The top 
20 signatures in each direction are shown. b, The overlap in the cycle-specific 
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treatment time and tumour models. e, The cell lines were treated as shown, to 
determine the level of p27 expression by immunoblotting. A representative of 
two independent experiments is shown for all except H2030, which was 
assayed once. f, KRAS(G12C)-mutant cells (H358) were synchronized with 
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expression (c).d, Comparison of the KRAS(G12C) output score between baseline versus 4-h G12Ci and 72-h G12Ci versus 72-h+ 4-h G12Ci. 
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Extended Data Fig. 6 | Genes with trajectory-specific expression profiles. 
a, Single cells were analysed to identify differentially expressed genes by 
contrasting paths 1and 2. The top 50 genes are shown. The teal dots indicate 
genes that were validated in subsequent experiments. b, ACRISPR-Cas9 
screen was carried out in H358 cells to help to narrow down the list of genes 
with trajectory-specific expression (by identifying and focusing on genes 
modulating the antiproliferative effect of the G12Ci). The schematic is not 
drawn to scale. Preference was given to genes with two or more sgRNAs that 
were downregulated by at least twofold in the G12Ci versus ¢, comparison and 


that were also identified as having trajectory-specific expressionin the scRNA- 


seq analysis. Pathways with several intermediates represented were 
prioritized. The number of gene-specific sgRNAs that were depleted during 
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G12Ci treatment is also shown. NT, non-targeting control.c, Thetrendin 
expression for the indicated genes as a function of pseudotime was established 
by fitting a spline to single-cell data. The 95% confidence interval is shown. The 
pseudotime was adjusted to compare between trajectories. d, The expression 
of the indicated genes in proliferating or quiescent cells. Only cells collected 
during the adaptive phase (24-72 h) of G12Ci treatment are shown. e, The gene 
false discovery rate (FDR) inthe indicated comparisons across either the entire 
cohort of cells or the subset of cells collected at the 72-h time point only 
(n=4,759 single cells in path 1, n=8,653 single cells in path 2,n=4,050 single 
cellsin path3,n=6,599 single cells in G1S,S,G2M,M or MG1 (proliferating) and 
n=3,578 single cells in GO (quiescent). 
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Extended Data Fig. 7 | The adaptive reactivation of KRAS during G12Ci 
treatment is dependent on EGFR signalling. a, The cells were treated with the 
G12Ci over time to determine the effect on HBEGF expression. mRNA 
expression was determined by scRNA-seq (mean, n>1,000 single cells per time 
point, see Fig. 1b) or by quantitative PCR (qPCR, mean+s.e.m.,n=3). The 
amount of protein secreted in the medium was quantified by enzyme-linked 
immunosorbent assay (mean +s.e.m.,n=3). Norm., normalized (minimum- 
maximum). b, Cells transfected with HBEGF-specific siRNAs were treated with 
increasing concentrations of G12Ci for 72 h to determine the effect on viability 
(meants.e.m.,n=5).c, Cells treated with the G12Ci for 72 h were stimulated 
with EGF for 10 min, alone or in combination with the indicated inhibitors. 
Quiescent cells (p27K high) were isolated by FACS and their extracts were 
assayed for active KRAS by RBD pull-down. Immunoblots were quantified 

by densitometry and reported as fold change relative to unstimulated. 

d,e, Untreated or G12Ci-treated (24 h) H358 cells were stimulated with EGF 
(200 ng mI) for 10 min alone or in combination with the indicated inhibitors. 
Cell extracts were analysed by immunoblotting (d). The effect of EGF 
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stimulation at baseline (lanes 2-4 versus lane 1) or after G12Ci treatment (lanes 
6-8 versus lane 5) was quantified by densitometry (e). f-i, The indicated 
KRAS©?¢-mutant lung cancer cells (f, g) or HA-KRAS(G12C)-expressing RASless 
mouse embryonic fibroblasts (h, i) were treated with the G12Ci alone or in 
combination with EGFR or SHP2 inhibitors, as shown. Cell extracts were 
subjected to RBD pull-down to determine the level of active (GTP-bound) 

and total KRAS. The HA tag was used to determine the specific effect on 
KRAS(G12C) (h, i).j, H358 cells were treated with the G12Ci alongside the EGFR 
inhibitor gefitinib (EGFRi), the pan-HER inhibitor afatinib (panHERi) or the 
SHP2 inhibitor SHPO99 (SHP2i) to determine the effect on cancer cell growth 
(top) and the presence of treatment synergy (bottom), by using the Bliss index. 
Red denotes synergy. The mean of three biological replicates is shown ontop. 
k, The indicated KRAS(G12C)-mutant cells were treated with increasing 
concentration of the G12Ci in the presence of 10%, 2% or 0% serum to determine 
the effect on cell viability (mean +s.e.m.,n=3).Arepresentative of two 
independent experiments is shown ind, f-i. Unless otherwise indicated, 
ndenotes biological replicates. 
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Extended Data Fig. 8 | AURK Ais involved in the reactivation of KRAS-GTP 
during G12Citreatment. a, KRAS(G12C)-mutant lung cancer cells were treated 
with the G12Ci alone or in combination with AURKA inhibitor (AURKAi, 
alisertib, 10 pM) or panAURK inhibitor (tozasertib, 10 pM) to determine the 
effect on KRAS-GTP levels over time. There is no effect on KRAS-GTP levels 
with the AURKAi treatment in the absence of the G12Ci treatment. b, RASless 
mouse embryonic fibroblasts expressing KRAS(G12C) were treated as shown 
with the indicated concentrations of AURKAi (1M) to determine the effect on 
KRAS(G12C)-GTP. c, H358 cells stably transfected with doxycycline-inducible 
AURKA (dox AURKA) were treated with the G12Ci in the presence or absence of 
doxycycline (2 pg mI"). Extracts from cells were analysed by immunoblotting 
to determine the effect on the indicated intermediates. d, H358 doxycycline- 
inducible AURKA cells were treated as shown and assayed to determine the 


effect oncell viability (mean+s.e.m,n=5).A two-tailed t-test Pvalueis shown. 
e-g, H358 cells stably expressing HA-tagged KRAS G12C under a doxycycline- 
inducible promoter were treated with doxycycline for 24 h alone (e) or with the 
indicated inhibitors (f, g). Cell extracts were immunoprecipitated and 
immunoblotted as indicated. h, KRAS(G12C)-mutant cell lines were treated as 
shown to determine the effect on cancer cell growth (top) and the presence of 
treatment synergy (bottom), by using the Bliss index. Red denotes synergy. The 
mean of three biological replicates is shown on top. i,j, Mice bearing SW1573 (i) 
or H2122 (j) xenografts were treated with the indicated inhibitors to determine 
the effect on tumour growth (mean + s.e.m, n= 6 in SW1573, n=5inH2122).A 
two-tailed t-test Pvalue is shown. A representative of at least two independent 
experiments is shown ina-g. Unless otherwise indicated, n denotes biological 
replicates. 
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Extended Data Fig. 9 | Inhibition of MAPK signalling stimulates new KRAS 
synthesis. a, The cells were treated with the indicated inhibitors and analysed 
to determine the level of KRAS MRNA or KRAS protein expression 

(mean +s.e.m.,n=3).LFC, log,-transformed fold change, relative to 0h. The 
indicated Pvalues were determined by ANOVA (P= 0.001) followed by pairwise 
comparisons versus baseline, while correcting for multiple hypotheses (using 
Dunnett’s test in Prism). b, SW1573 (KRAS°”?") cells were transfected with non- 
targeting (NT) or KRAS-specific siRNAs followed by treatment with the G12Ci 
and immunoblotting. c, H358 cells engineered to express HA-KRAS(G12C) 
under a doxycycline-inducible promoter were treated with the G12Ci, alone or 
inthe presence of doxycycline, to determine the effect on cell viability at 72h 
(mean +s.e.m.,n=3).d,H358 p27K cells were stably transfected with 


NT siG12C 


doxycycline-inducible siRes-G12C. The cells were transfected with KRAS©”© 
siRNA (siG12C) followed by doxycycline (2 pg ml”) induction. The effect on cell 
viability is shown as mean ¢+s.e.m. (n=5 without doxycycline, n=4 with 
doxycycline). A two-tailed t-test Pvalue is shown. e, H358 cells with 
doxycycline-inducible HA-KRAS(G12C) were treated with doxycycline 

(2 ug mI’) for 24 hin serum-free medium. Then, the cells were exposed to either 
EGF (200 ng mI*) followed by the G12Ci (10 1M), or vice versa. Cell extracts were 
analysed by RBD pull-down and immunoblotting. The specific effect on 
KRAS(G12C) was determined by the HA tag. A representative of at least two 
independent experiments is shown inb, d, e. Unless otherwise indicated, 
ndenotes biological replicates. 
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output, bypassing drug-induced quiescence to resume proliferation. 
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Data exclusions Single cells were excluded from analysis based on pre-established criteria indicating poor quality of sequencing or doublet reads as indicated 
in methods. FACS experiments were gated to analyze only live cells, others were omitted. 


Replication The experimental replicates are indicsted in the figure legends. All stated replicates indicate either biological replicates (e.g. viability studies, 
tumor growth measurements) or independent experimental repeats (e.g. western blotting, biosensor studies) 


Randomization — In vivo tumor growth experiments: implanted mice were treated in a random manner with vehicle, single agent or drug combinations. 
Otherwise not applicable. 


Blinding Tumor growth measurements were carried out in a non blinded manner by a technician not aware of the goals of the experiment. Otherwise 
not applicable. 
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Cell line source(s) ATCC 
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Mycoplasma contamination The cell lines used in the study tested negative for mycoplasma. 
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Methodology 
Sample preparation Cells were harvested with TrypLE Express (Gibco) and fixed for 10 min at room temperature with 4% paraformaldehyde. Samples 
were washed twice with PBS and filtered through 35m mesh-caped collection tubes. 
Instrument BD LSRFortessa 
Software FCS Express 6 Flow and GraphPad Prism 7.0a software 


Cell population abundance Attached cells were assayed to determine the distribution of the GO biosensor. The entire distribution is shown. 


Gating strategy Forward and reverse scatter parameters were used for gating. The distribution of the entire cell population is shown for the GO 
biosensor. See methods. 
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The organization of genomic DNA into nucleosomes profoundly affects all DNA- 
related processes in eukaryotes. The histone chaperone known as ‘facilitates 
chromatin transcription’ (FACT’) (consisting of subunits SPT16 and SSRP1) 
promotes both disassembly and reassembly of nucleosomes during gene 
transcription, DNA replication and DNA repair’. However, the mechanism by which 
FACT causes these opposing outcomes is unknown. Here we report two 
cryo-electron-microscopic structures of human FACT in complex with partially 
assembled subnucleosomes, with supporting biochemical and hydrogen- 
deuterium exchange data. We find that FACT is engaged in extensive interactions 
with nucleosomal DNA and all histone variants. The large DNA-binding surface on 
FACT appears to be protected by the carboxy-terminal domains of both of its 
subunits, and this inhibition is released by interaction with H2A-H2B, allowing 
FACT-H2A-H2B to dock onto a complex containing DNA and histones H3 and H4 
(ref. 3). SPT16 binds nucleosomal DNA and tethers H2A-H2B through its carboxy- 
terminal domain by acting as a placeholder for DNA. SSRP1 also contributes to DNA 
binding, and can assume two conformations, depending on whether a second 
H2A-H2B dimer is present. Our data suggest a compelling mechanism for how FACT 
maintains chromatin integrity during polymerase passage, by facilitating removal 
of the H2A-H2B dimer, stabilizing intermediate subnucleosomal states and 
promoting nucleosome reassembly. Our findings reconcile discrepancies regarding 
the many roles of FACT and underscore the dynamic interactions between histone 


chaperones and nucleosomes. 


Nucleosomes are the basic repeating structural units of eukaryotic 
chromatin, consisting of two H2A—H2B dimers and one (H3-H4), 
tetramer, wrapped by 147 base pairs (bp) of DNA*. They represent 
formidable barriers to the transcription and replication machinery, 
and therefore must be at least partially dismantled ahead of polymer- 
ases and reassembled in their wake, to maintain the structural and 
regulatory functions of chromatin’. Histone chaperones—a diverse 
group of structurally unrelated proteins—facilitate these processes®. 
FACT (Fig. 1a) is an essential histone chaperone?’”® (reviewed in ref. ”) 
that in vitro promotes nucleosome assembly by tethering histone 
and DNA components, and also facilitates nucleosome disassembly 
by displacing H2A-H2B and promoting the partial unravelling 
of DNA?*”, Although the structure of each individual domain in 
both FACT subunits (SPT16 and SSRP1) is known’, the structure of 
the intact FACT heterodimer alone or with its relevant substrate(s) 
has been elusive, and insight into its mechanism is therefore largely 
indirect. 


FACT-nucleosome complex resembles a unicycle 


An intermediate state of FACT-mediated nucleosome assembly has 
previously been identified’. This complex can be biochemically recon- 
stituted by combining 79 bp of the ‘601’ DNA prebound to a (H3-H4), 
tetramer (‘tetrasome’), with FACT prebound to H2A-H2B (Fig. 1b). Dur- 
ing grid preparation for our present cryo-electron-microscopy (cryo- 
EM) studies, very few particles of this intermediate complex escaped 
being damaged by the water-air interface in conventional blotting 
and plunge-freezing procedures. However, adding the zwitterionic 
detergent CHAPS, followed by sample aspiration and rapid freezing 
using Chameleon, a commerical version of ‘Spotiton™, resulted in 
well-dispersed particles (Extended Data Fig. 1a). 

Through extensive three-dimensional classification (Extended 
Data Figs. 1-3 and Extended Data Table 1), we obtained two 
maps at 4.9 A (complex 1) and 7.4 A (complex 2) resolution, which on 
first inspection differ in the number of H2ZA—-H2B dimers bound to 


‘Department of Biochemistry, University of Colorado at Boulder, Boulder, CO, USA. Department of Chemistry and Biochemistry, The University of Texas at Dallas, Dallas, TX, USA. *National 
Resource for Automated Molecular Microscopy Simons Electron Microscopy Center, New York Structural Biology Center, New York, NY, USA. “Department of Physiology and Cellular 
Biophysics, Columbia University, New York, NY, USA. °Biochemistry and Molecular Biophysics Department, Columbia University Medical Center, New York, NY, USA. Howard Hughes Medical 
Institute, Chevy Chase, MD, USA. ’Present address: Department of Physiology and Cellular Biophysics, Columbia University, New York, NY, USA. ®Present address: Biochemistry and Molecular 
Biophysics Department, Columbia University Medical Center, New York, NY, USA. °These authors contributed equally: Yang Liu, Keda Zhou. *e-mail: karolin.luger@colorado.edu 


426 | Nature | Vol577 | 16 January 2020 


c 


i ona (J nea Mixes (Jus (i Hs BBE] ssrei eovop) SPT16 (MD/DD) 


Class 1 (complex 1) 


Fig.1| FACT forms two complexes with a partially assembled nucleosome. 

a, Domain structure of the two FACT subunits, SPT16 and SSRP1. DD, 
dimerization domain; CTD, C-terminal domain; HMG, HMG-box domain; IDD, 
intrinsically disordered domain; NTD, N-terminal domain; MD, middle domain. 
Domains for which no density is visible are shown in white with a dashed 
outline. b, Biochemical characterization of FACT-subnucleosome complexes 
by native PAGE and SDS-PAGE, stained by SYBR gold and Blazin blue, 


the (H3-H4),-DNA-FACT complex (Fig. Ic, d). In both structures, 
histones bind DNA and each other ina manner that is identical to their 
interactions ina complete nucleosome’. Although we used full-length 
FACT, we observed no density for the amino-terminal domain (NTD) 
of SPT16 and several of the C-terminal domains (CTDs) of SSRP1 
(Fig. 1a). 

The overall architecture of both structures resembles a unicycle, 
consisting of a saddle and fork formed by the FACT heterodimer, a 
wheel (the tetrasome) and one or two pedals (H2A-H2B dimers) (Fig. Ic, 
dand Supplementary Video 1). The SPT16 dimerization domain strad- 
dles nucleosomal DNA at the dyad, and the SSRP1 dimerization domain 
stacks on top of the SPT16 dimerization domain, together forming 
the saddle. The SPT16 and SSRP1 middle domains project down on 
either side of the tetrasome wheel and interact with DNA and histones, 
forming the fork. On the SPT16 side, one H2A—H2B dimer pedal is 
docked onto the tetrasome through a nucleosomal four-helix bundle 
arrangement between H2B and H4, and the SPT16 CTD wraps around the 
exposed H2A-H2B DNA-binding surface. The SSRP1 middle domainis 
located on the other side of the tetrasome wheel. Although no H2A-H2B 
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respectively. This experiment was repeated more than ten times with similar 
results. c, Ribbon diagram of DNA, histones and FACT domains fit into class1 
(top) and class 2 (bottom) electron-density maps (80% transparency) 

(see Methods for definitions of classes). d, Schematic of subnucleosomal 
particles and FACT (top row), and of the two complexes inthe same orientations 
asinc (bottom two rows). 


is present on the SSRP1 side in complex 1, we observe clear electron 
density for asecond H2A-H2B dimer in complex 2, comprising 20-30% 
of the intact particles (Fig. 1c, bottom). This second H2A-H2B dimer is 
also engaged with the tetrasome in the same manner as ina complete 
nucleosome, despite the absence of DNA, andisin close proximity with 
the rearranged SSRP1 middle domain. The presence of two complexes 
is consistent with biochemical data’. 


FACT has an extensive DNA-binding surface 


The extensive interaction of both FACT subunits with nucleosomal 
DNAis surprising, as in absence of histones FACT (in its phosphorylated 
form) does not bind DNA (see below). We find that approximately 19 bp 
of DNA are contacted by FACT (Fig. 2a), burying acombined surface area 
of roughly 2,300 A’. Elements from the SPT16 dimerization domain and 
both middle domains contribute to DNA binding. The inner surface of 
the FACT saddle and fork is positively charged, and the tips of the fork 
are negatively charged, promoting FACT interactions with DNA and 
histones in the configuration of the tetrasome (Fig. 2a). 
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Fig. 2| FACT makes extensive contacts with tetrasome DNA. a, Surface 
potential of the FACT-subnucleosome complex. Colouring changes from red 
for-5kT e'to blue for +5 kT e”, generated with APBS Tools 2.1 (PyMOL 2.3.2). 
Histones are omitted from the top centre and right panels. b, Regions of 
significant (P< 0.01) changes in average HDX (AHDX, fromc, d) are mapped 
onto complex 1. Notable regions are coloured according to domain location 
(Fig. 1a). Regions with no detectable change are in white; regions for whichno 
peptides were recovered are in grey. The grey arrow denotes protected regions 
at the intersubunit interface. The subnucleosome is coloured wheat. 
Representative uptake plots are listed in Extended Data Fig. 5.c, Volcano plot 
comparing the average AHDX of SPT16 peptides in FACT and in FACT bound to 
subnucleosomesat all time points. Each time point was collected in triplicate 
(n=3); the Welch’s t-test was one-sided. d, Volcano plot comparing average 
AHDxX of SSRP1 peptides in FACT and in FACT bound to subnucleosomesat all 


Our map does notallow us to assign amino acids in FACT that directly 
participate in subnucleosome binding. To validate our structures, 
we used hydrogen-deuterium exchange (HDX) coupled to mass spec- 
trometry, comparing deuterium uptake in FACT alone and in FACT 
bound to subnucleosomes (Extended Data Figs. 4, 5). On the basis of 
volcano plot analysis”, we find a significant decrease in deuterium 
uptake (with a AHDX of more than 0.3 Da; P< 0.01) in regions of the 
dimerization and middle domains of SPT16 and SSRP1. These regions 
map primarily to the inner face of the fork that interfaces directly with 
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time points. Each time point was collected in triplicate (n= 3) and the Welch's t- 
test was one-sided. For c,d, dotted lines show significance cut-offs of A average 
HDX >0.3 Daand P<0.01 froma Welch’s t-test. Insignificant peptides are in 
light grey, and significant peptides are coloured according to their domain 
location and listed. e, DNA binding by FACT CTD deletions. We incubated 

400 nM FACT or deletion constructs with 100 nM Atto-647-labelled 30-bp DNA, 
and analysed the results by native PAGE, visualized by DNA fluorescence. This 
was repeated twice with similar results, and multiple times with DNA of 
different lengths. AC, ACTD; FL, full-length. f, Fluorescence polarization assay 
of FACT (0-2,500 nM) with 20 nM Alexa-488-labelled tetrasome. Dissociation 
constants (K,) (innM): full-length FACT (magenta) 204.2 +19 (R?= 0.9723); 
SPT16AC-SSRPI (blue) 184.8 + 14 (R? = 0.9807); SPT16-SSRP1AC (orange) 

201.0 + 50.6 (R?= 0.8266); and ACAC (black) 28.15 +1.9 (R?= 0.9843). Kyand 
standard deviations were derived from four biological replicates. 


DNA and histones (Fig. 2b-d). A decrease in deuterium is also observed 
at the interface between the SSRP1 middle domain and both dimeriza- 
tion domains (the hinge region) (Fig. 2b), suggesting decreased domain 
flexibility upon subnucleosome binding. These solution-based results 
are entirely consistent with the structures, and almost no changes in 
deuterium uptake are observed in regions outside those present in 
the structure. 

Despite the extensive DNA interface, phosphorylated FACT purified 
from insect cells is unable to bind free DNA (Fig. 2e). We hypothesized 
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Fig. 3 | SPT16 interactions with the subnucleosome. a, All visible domains of 
SPT16 interact extensively with the subnucleosome. The SPT16 CTD occupies 
DNA-binding surface of H2A-H2B. b, Volcano plot comparing the average 
AHDX of H2A-H2B with FACT and H2A-H2B with FACT and tetrasomes after 
100s exposure to deuterium. Data were collected in triplicate (n= 3); Welch’s 
t-test was one-sided. Dotted lines show significance cut-offs of A average 
HDX > 0.3 Daand P< 0.01 froma Welch’s t-test. Peptides from H2A are black, 
and from H2Bare grey. Notable peptides are listed. c, Regions of significant A 
average HDX (from b) mapped onto H2A-H2B from complex 1. Parts of the 


that the disordered CTDs of one or both FACT subunits block DNA 
binding by preventing access to the concave side of the saddle, 
and that this autoinhibition could be relieved by H2A—H2B interac- 
tion, which is known to occur through either CTD”. To test this, we 
performed DNA-binding assays with FACT C-terminal deletion 
constructs (Fig. 2e and Extended Data Fig. 6a). Deletion of either 
the SPT16 CTD or the SSRP1 CTD alone had no effect, but dele- 
tion of both simultaneously enabled FACT to shift free DNA ona 
native gel. DNA interaction was also facilitated (to a lesser degree) 
when full-length FACT was prebound to an H2A-H2B dimer (Extended 
Data Fig. 6b). It has previously been found that FACT does not 
produce a gel shift with tetrasome’. To test whether this is also 
due to autoinhibition by FACT CTDs, we used fluorescence 
polarization binding assays. As was the case for DNA binding, dele- 
tion of both CTDs resulted in an enhanced affinity of FACT for the 
tetrasome (Fig. 2f). 


SPT16 C-terminal tail shields H2A-H2B 


The acidic SPT16 CTD (pI of 4.3) wraps around the DNA-binding surface 
of H2A-H2B (Figs. Ic, 3a). The precise SPT16 residues involved cannot 
be identified from the density or the HDX owing to the low coverage 
in that region (Extended Data Fig. 4a), but we observe a slight yet sig- 
nificant protection of residues 1016-1030 (P< 0.01) (Extended Data 
Fig. 4c). H2A—H2B interfaces can be identified by comparing the HDX 
of H2A-H2B that is bound by FACT, with H2A-H2B in the FACT-bound 
subnucleosome (Fig. 3b and Extended Data Fig. 7). Direct comparison 
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docking domain are shown in space-filling mode. For H2A-H2B, regions withan 
HDX change are in yellow (H2A) and red (H2B); regions with no peptide 
coverage are in grey. FACT and tetrasome are shown in wheat. d, Comparison of 
the SPT16 MD in complex 1, with the SPT16 MD bound to (H3-H4), 

(RCSB Protein Data Bank (PDB) code 4Z2M). Only DNA and the SPT16 MD from 
complex 1are shown; the two structures were aligned on the basis of the 
(H3-H4), tetramer (root mean square deviation (r.m.s.d.) < 0.1 A). represents 
the dyad (the central base pair) of the nucleosomal DNA. 


with free H2A-H2B is convoluted by the global stabilization of histone 
folds upon protein interaction”. Volcano plot analysis shows a signifi- 
cant decrease in deuterium uptake (AHDX is more than 0.3 Da; P< 0.01) 
in H2A-H2B regions involved in nucleosome formation. In H2B the 
decrease is localized to its DNA-binding L2 loop and interface with 
H4, whereas in H2A it is localized to the docking domain that binds 
H3 aN and the SPT16 middle domain (see below) and to the L2 loop 
(Fig. 3c). Notably, H2A L2 binds the SPT16 CTD rather than DNA, sup- 
porting our interpretation that the SPT16 CTD acts the part of DNA” and 
might counteract the ‘foreign DNA invasion’ observed in transcription 
through nucleosomes®””. 


SPT16 has two binding modes on (H3-H4), 


A crystal structure of the SPT16 middle domain with the (H3-H4), 
tetramer reveals two regions of contact, neither of which is possible 
in the context of the subnucleosome”™. Our structures confirm the 
prediction” that one region of interaction is occluded by H3 aN and 
the H2A-docking domain, and the other is occupied by nucleosomal 
DNA. Consequently, the SPT16 middle domain (and with it the entire 
FACT complex) moves and rotates substantially by roughly 31 A and 
23° when the substrate transitions from free (H3-H4), to tetrasome 
(Fig. 3d). This frees up space for the H2A-docking domain, while the 
SPT16 middle domain binds to the nearby DNA (interface 1) (Extended 
Data Fig. 8). It also displaces a segment of FACT that in the absence 
of DNA covers the DNA-binding L1 loop of histone H4. Instead, this 
FACT region is now near the H4 N-terminal tail and H3 al (interface 
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Fig. 4| SSRP1can assume two different positions onthe subnucleosome, 
depending onthe presence or absence of asecond H2A-H2B dimer. a, The 
domain structure of SSRP1. b, Comparison of binding interfaces between 
SSRP1land the sunucleosome in complex 1 (top) and complex 2 (bottom). 
Interfaces are enlarged ininsets. The molecular surface of SSRP1is shown by 


2) (Extended Data Fig. 8). Overall, this massive rearrangement allows 
SPT16 middle domain to make interactions with nucleosomal DNA 
while also forging new interactions with histones—thereby stabiliz- 
ing asubnucleosomal particle. Of note, the previously demonstrated 
interaction between the SPT16 middle domain and a fused H2A—-H2B 
is not observed””°. 


SSRP1I can exclude or hold a second H2A-H2B 


The most substantial difference between our two structures is on the 
SSRP1 side of the complex: they differ in the absence (complex 1) or 
presence (complex 2) of asecond H2A-H2B dimer, probably repre- 
senting two stages of the assembly in equilibrium (Figs. lb-d, 4 and 
Extended Data Fig. 9). In both structures, the SSRP1 middle domain 
(consisting of two pleckstrin homology (PH) domains”) interacts with 
both dimerization domains, and the PH1 domain makes numerous 
contacts with the DNA (interface 1 in Fig. 4b). Although the relative 
orientation of PH1 with respect to the DNA is slightly different in the 
two structures, the same structural elements are likely to be engaged 
with DNA (Fig. 4a, b; interfaces 1 and 1’). In complex 1, the SSRP1 PH2 
domain is positioned across the (H3-H4), four-helix bundle and ontop 
of H3 «2, thereby precluding the placement of the second H2A-H2B 
and H3 aN in its rightful position in the nucleosome (interface 2 in 
Fig. 4b). In complex 2,a second H2A-H2B has successfully docked onto 
the histone core, facilitated by interactions with the PH2 domain of 
SSRP1 middle domain (interface 2’ in Fig. 4b). H3 aN and an additional 
5 bp of DNA (which binds the L1L2 interface of the second H2A-H2B 
dimer) are clearly visible in the electron-density map for complex 2, 
but are absent in complex 1. The relocation of the SSRP1 PH2 domain on 
the tetrasome surface by 10.9 A and 9.5° (upon addition of the second 
H2A-H2B dimer) (Fig. 4c) is partially transmitted to the dimerization 
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“_, Additional 5 bp DNA visible 


the main-chain trace.c, The SSRP1MD and DDin complexes 1 and 2 undergo 
substantial rearrangement on the subnucleosome surface when 
superimposing the two complexes via the (H3-H4), tetramer. The SSRPIMDin 
complex 1 would clash with the H2A-docking domain. 


domains, and also results in a subtle movement of the SPT16 middle 
domain (Supplementary Video 2). 


Implications for FACT mechanism 


The structures reported here and previously’, together with HDX data, 
highlighta plasticity of FACT interactions (Supplementary Videos2and3) 
that is likely to be required for its diverse functions. In the absence of 
DNA, FACT maintains the tetrameric conformation of (H3-H4), dur- 
ing early steps of nucleosome (re)assembly. In this mode, FACT might 
also bind H2A-H2B dimers through the SPT16 CTD. DNA then binds 
the tetramer and displaces FACT, which assumes its position ontop of 
nucleosomal DNA while forging new contacts with H3-H4. This pro- 
motes incorporation of the tethered H2A-H2B by freeing up H3-H4 
to engage its H2A-docking domain. FACT maintains interactions with 
the H2A-H2B DNA-binding surface through the SPT16 CTD, thereby 
stabilizing the subnucleosome. A similar interaction between the SPT16 
CTD and H2A-H2B has recently been observed, although no other FACT 
domains were visible in that structure”. 

Our structures also suggest a potential mechanism for FACT-medi- 
ated nucleosome disassembly” through a reversal of these steps. 
When the DNA at superhelix location —1 is peeled off from H3-H4 by 
DNA or RNA polymerase’, the SPT16 middle domain protects the newly 
exposed sites on the histones on the proximal side of the nucleosome, 
allowing FACT to contact the H3-H4 four-helix bundle, and resulting in 
the displacement of the H2A-docking domain (Supplementary Video 3) 
to facilitate the passage of polymerase. Similarly, the SSRP1 middle 
domain can also switch between two binding modes to accommodate 
either addition or destabilization of H2A-H2B on the distal side of the 
nucleosome. This mechanism is consistent with observations of his- 
tone occupancy in an Sptl6 or Ssrp1 knockdown in Drosophila cells”’. 


Both processes might be facilitated by the HMG domain of SSRP1 
and other domains not visible in our structure, and this is the topic of 
ongoing investigations. 
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Methods 


Expression and purification of FACT and mutant FACT 

We infected Sf9 cells with Autographa californica multiple nucleo- 
polyhedrovirus (ACMNPV) to express either wild-type human FACT or 
FACT with C-terminal deletions (SPT16AC(A934-1,047)-SSRP1, SPT16- 
SSRP1AC(A625-709) or SPTI6AC(A934-1,047)-SSRP1AC(A625-709)). 
We purified proteins as previously described’, with minor modifica- 
tions. We fused a x6 histidine (His) tag to the N terminus of SPT16, and 
purified FACT over a 5-ml prepacked HisTrap HP column, followed by 
a5-ml prepacked HiTrap Q HP column. The last step was a Superdex 
200 10/300 size-exclusion column run in 200 mM NaCl, 20 mM Tris- 
Cl (pH 8.0), 0.01% CHAPS, 0.01% octylglucoside (OG), 5% glycerol and 
1mMtris(2-carboxyethyl)phosphine (TCEP). Columns were purchased 
from GE Healthcare. 


Complex formation 

We expressed and purified recombinant human histones as previously 
described”. We premixed 1 1M FACT with equimolar amounts of refolded 
H2A-H2B for 10 min at room temperature, and then added an equimolar 
amount of (H3-H4), tetrasome reconstituted onto 79 bp 601 DNA” in 
20 mM Tris-Cl (pH 8.0), 150 mM NaCl, 1 mM EDTA and1mM TCEP™. After 
a30-min incubation, the complex was visualized with 5% native PAGE. 


Binding assays 

We mixed 400 nM FACT (wild type or deletion constructs) with 100 nM 
of Atto-647-labelled 30 bp DNA in 20 mM Tris-Cl (pH 8.0), 100 mM 
NaCl, 1mM EDTA and 1mM TCEP. After incubation for 10 minin room 
temperature, DNA shifting was analysed by 5% native PAGE. 

To measure FACT-tetrasome interactions, we mixed varying amounts 
of FACT (up to 2,500 nM) with 20 nM Alexa-488-labelled H3-H4 tetra- 
some in 20 mM Tris-Cl (pH 8.0), 0.01% CHAPS, 0.01% NP40, 100 mM 
NaCl, 1 mM EDTA and 1 mM TCEP. The reaction was incubated in a 
384-well microplate for 10 min at room temperature and fluorescence 
polarization data (obtained on a CLARIOstar microplate reader from 
BMG Labtech) were analysed by Graphpad Prism. Four replicates were 
done. Absolute K, values obtained with labelled tetrasome have to be 
taken with caution because of known tetrasome heterogeneity between 
batches. Nevertheless, comparative values obtained with the same batch 
are reproducible. Original datacan be found in Supplementary Table 1. 


Single-particle cryo-EM sample vitrification 

Reconstituted complexes were concentrated to 7-10 iM using an Ami- 
con Ultra-4 centrifugal filter (Ultracel 50K, Millipore). Fresh detergent 
(CHAPS, 0.25-0.5%) was added before grid preparation. To minimize 
sample interaction with the air—water interface and to keep the com- 
plex intact”, we vitrified the complex using a commercial prototype 
(from TTP Labtech) of the Chameleon robot based on the Spotiton 
robot*”®”, The time between the sample being spotted onto the nanow- 
ire grid* and vitrification into liquid ethane was 133 ms. Grids were 
plasma-cleaned (Gatan Solarus) for 5 s using an H,/O, mixture before 
vitrification. The robot chamber was operated at room temperature 
and roughly 80% humidity in the sample and plunging area. 


Data acquisition 

We recorded images on a Titan Krios electron microscope (FEI) 
equipped with a caesium corrector and a K2 summit direct detector 
with a quantum energy filter (Gatan) at 1.0961 A per pixel in counting 
mode, using the Leginon software package”. Pixel size was calibrated 
in-house using a proteasome test sample. We used an energy-filter 
slit width of 30 eV during the collection, aligned automatically every 
hour using Leginon. The first data collection was performed using a 
dose of roughly 66.37 e' A? across 50 frames (200 ms per frame) at a 
dose rate of roughly 8.0 e pix 's”, using a set defocus range of -1.9 ym 
to -2.0 um. The second data collection was performed using a dose 


of roughly 63.61 e A across 50 frames (200 ms per frame) at a dose 
rate of around 7.6 e&’ pix 's”, using a set defocus range of -1.5 pm to 
-2.5 um. We used a 100-ppm-aperture objective. We recorded a total of 
8,318 micrographs over two separate data-collection sessions using an 
image beam shift data collection strategy”. 


Data processing 

Data from the two sessions were processed separately and combined 
towards the end of the processing pipeline. For the first dataset, movie 
frames were aligned using MotionCor2 (ref. **) with 5-by-5 patches 
and aB-factor of 100 using the Appion software package™*. We carried 
out micrograph contrast transfer function (CTF) estimation using 
CTFFind4 (ref. >). We used DoG picker”, a template-free particle- 
picking algorithm, to pick 744,552 particles (box size of 240, binned 
by 2) that were extracted in Relion 3.0 (refs. *” *’) and transferred into 
CryoSPARC 2 (ref. *°) for two-dimensional (2D) classification. We dis- 
carded 2D class averages that were clearly contaminated or showed no 
features. We thenrancryo-EM single-particle ab initio reconstruction 
and classification (CryoSPARC) using 2 classes iteratively for 5 rounds, 
keeping particles in the better class for the next iteration, producing 
a final stack of 17,369 particles. These particles were processed using 
nonuniform refinement to produce a map of around 6 A resolution. 
We used the Euler angles and shifts to re-extract the particlesin Relion 
3.0, followed by CTF refinement and Bayesian polishing”. The particles 
were brought back into CryoSPARC 2 for nonuniform refinement. We 
then used improved Euler angles and shifts to re-extract the particles 
in Relion 3.0 for two more rounds of Bayesian polishing. The final nonu- 
niform refinement of 17,369 particles (extracted using a 256 box size 
binned by a 1.28 to 200 box size) resulted ina map at 4.4 A resolution 
based on the Fourier shell correlation (FSC) 0.143 criterion*”’. 

For the second dataset, movie frames were aligned using patch 
motionin CryoSPARC 2. CTF was estimated ina patch manner and par- 
ticles were picked with the CryoSPARC 2 blob picker. A total of 522,050 
picked particles were then extracted (256 box size binned by 2) and 
2D classification was performed. Four rounds of iterative CryoSPARC 
ab initio using three classes were used to clean up the dataset—the 
particles in the best class after each round were carried forward to the 
next iteration. The remaining 8,399 particles were refined to around 
8 Aresolution using nonuniform refinement. The Euler angles and 
shifts were then used to re-extract the particles in Relion 3.0, followed 
by CTF refinement and Bayesian polishing”. The particles were brought 
back into CryoSPARC 2.0 for nonuniform refinement. Improved Euler 
angles and shifts were then used to re-extract the particles in Relion 
3.0 for two more rounds of Bayesian polishing. The final nonuniform 
refinement of 8,399 particles (extracted using 256 box size binned by 
1.28 to 200 box size) resulted in a map at 7.4 A resolution based onthe 
FSC 0.143 criterion*”**. 

We carried out heterogeneous refinement using four classes for these 
particles. We identified two distinct classes, differing by the presence or 
absence of one of the histone dimers. Class 1, with 10,317 particles and 
without the second histone H2A-H2B dimer, resulted ina map at 6.6A 
resolution using nonuniform refinement; class 2, with 4,212 particles 
and the second H2A-H2B dimer, resulted in a map at 7.3 A resolution 
using nonuniform refinement. 

We then combined both datasets in CryoSPARC 2.0 and carried out 
athree-class ab initio run to sort out heterogeneity in the dataset. We 
identified two distinct classes, differing by the presence or absence of 
one of the histone dimers. Class 1, with 16,317 particles and without the 
second histone H2A-H2B dimer, resulted in a map at 4.9 A resolution 
using nonuniform refinement; class 2, with 6,990 particles and the 
second H2A-H2B dimer, resulted in a map at 7.4 A resolution using 
nonuniform refinement. We used the locally sharpened maps from 
CryoSPARC 2.0 for subsequent model building and analysis. The three- 
dimensional (3D) FSC and sphericity of the maps were calculated using 
the 3DFSC server“. 


All conversions between Relion and CryoSPARC were performed 
using D. Asarnow’s pyem script (personal communication; https:// 
github.com/asarnow/pyem). 


Model building and refinement 

The initial models were built by fitting published high-resolution crys- 
tal structures or homology models into the final 3D electron maps in 
University of California at San Francisco (UCSF) Chimera*. The crystal 
structures (with PDB identification code) used for initial modelling are: 
3LZO (‘601’ Widom DNA), 3AFA (human histone core), 4Z2M (SPT16 
middle domain) and 4IFS (SSRP1 middle domain). The initial model 
for dimerization domains was a combination of a crystal structure 
(SUMR, SSRPI; amino acids 1-100) and a homology model based on 
the crystal structure of the Chaetomium thermophilum Spt16 Pob3N 
heterodimer (4KHB) using SWISS-MODEL“. The initial models were 
then manually edited using COOT*. We used Poly-UNK to fit the visible 
density without assigning amino acids. We refined the models using 
real-space refinement in PHENIX*®. 


HDX-MS 
We prepared 7 1M H2A-H2B, FACT, FACT-H2A-H2B and FACT-subnu- 
cleosome in 20 mM Tris-HCl, 150 mM NaCl, 1mM TCEP, pH 7.5 at 25 °C. 
Samples were diluted 1/6 with the same buffer containing H,O for con- 
trols (pH,eaq= 7.5 at 25 °C) or DO (final D,O concentration of 83% (v/v) 
with pH,eaq= 7-1 at 25 °C). After 10 s, 10*s, 10° s and 10*s, samples were 
mixed 1/1 with 1% formic acid, 3.84 M guanidinium chloride, pH1.75 to 
give a final pH of 2.3 at 0 °C, and flash-frozen in liquid nitrogen for stor- 
age at -80 °C. Samples were thawed and liquid chromatography-mass 
spectrometry (LC-MS) performed using a Waters HDX manger and 
SYNAPT G2-Si Q-Tof. Three technical replicates of each sample were 
analysed in a random order. Samples were digested on-line using Sus 
scrofa pepsin A (Waters Enzymate BEH) at 15 °C, and peptides were 
trapped onaC4 pre-column (Waters Acquity UPLC Protein BEH C4) for 
3 min at 100 pl min™ and 1 °C. The liquid-chromatography buffer was 
0.1% formic acid. Peptides were separated over a C18 column (Waters 
Acquity UPLC BEH) and eluted with a linear 3-40% (v/v) acetonitrile 
gradient for 7 minat 40 pl min “and 1 °C. Raw data from control samples 
were processed by PLGS (Waters Protein Lynx Global Server 3.0.2) using 
a database containing Sus scrofa pepsin A and Homo sapiens SSRP1, 
SPT16, H2A and H2B. In PLGS, the minimum fragmention matches per 
peptide was 3, and methionine oxidation and serine/threonine/tyrosine 
phosphorylation were allowed. The low and elevated energy thresholds 
were 250 and 50 counts, respectively, and the overall intensity threshold 
was 750 counts. Labelled data were analysed in DynamxX 3.0 with 0.3 
products per amino acid and one consecutive product. 
Allmass-spectrometry data were acquired using positive-ion mode 
in either high-definition mass spectrometry (HDMS) or HDMS‘ mode, 
the latter being used to collect both low-energy (6 V) and high-energy 
(ramping 22-44 V) peptide-fragmentation data for peptide identi- 
fication. HDMS mode was used to collect low-energy ion data for all 
deuterated samples. All samples were acquired in resolution mode. 
Capillary voltage was set to 2.8 kV for the sample sprayer. Desolvation 
gas was set to 6501 hat 175 °C. The source temperature was set to 
80 °C. Cone and nebulizer gas was flowed at 90 1h ‘and 6.5 bar, respec- 
tively. The sampling cone and source offset were both set to 30 V. Data 
were acquired at a scan time of 0.4 s with a m/z range of 100-2,000. 
Mass correction was done using [Glu1]-fibrinopeptide B as areference 
mass. We include as Supplementary Information a summary of our HDX 
data (Supplementary Table 2) as well as the HDX data (Supplementary 
Table 3), as per consensus guidelines”. 


Sample size and randomization 


No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were 


not blinded to allocation during experiments and outcome 
assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Raw HDX-MS data are available in Supplementary Tables 2 and 3 for 
Figs. 2b-d, 3b, c. Atomic coordinates and cryo-EM maps have been 
deposited in the PDB and Electron Microscopy Data Bank (EMDB) under 
accession codes EMDB-20840, PDB 6UPK and EMDB-20841, PDB 6UPL. 
Raw cryoEM images have been deposited in the Electron Microscopy 
Public Image Archive (EMPIAR-10333). 
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Extended Data Fig. 1| Analysis of electron-microscopic data. 

a, Representative micrograph of particles containing 79 bp of 601DNA 
prebound toa (H3-H4), tetramer (‘tetrasome’), with FACT prebound to 
H2A-HZ2B. Theinset is a lower-magnification view showing the holy substrate. 
The red X marks the location of this micrograph in the low-magnification map. 


7.4 A map 


The particles in the final picking are circled in green. This experiment was 
repeated with six independent grids in two Krios sessions, with similar results. 
b, Two-dimensional class averages generated from the dataset obtained from 
CryoSPARC 2.0. c, Flowchart showing the data-processing strategy. 
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Extended Data Fig. 2| Validation of electron-microscopic data. a, Local- b, Local-resolution map and FSC curve for class 2 (complex 2). Local resolution 


resolution map and FSC curve for class 1 (complex 1). Local resolution was was calculated asina. 
calculated in CryoSPARC 2.0 and 3DFSC was calculated using the 3DFSC server. 
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Extended Data Fig. 3| Electron density and models. a, Density mapsforclass1 density maps for FACT-subnucleosome complex 1, DNA, H2A-H2B, (H3-H4),, 
and 2 particles in three orthogonal orientations. The subunits and domains are SPT16 middle domain, SSRP1 MD and SPT16-SSRP1 dimerization domains. 
colour-coded as indicated. b, Representative views of models fit into electron- 
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Residues with significant AHDX between FACT and FACT+sub-nucleosomes (Fig. 2b): 
21-27, 486-500, 560-572, 619-620, 622-636, 807-815, 864-870, 901-915, 1016-1030. 
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Residues with significant AHDX between FACT and FACT+sub-nucleosomes (Fig. 2c): 
105-108, 135-139, 180-186, 201-215, 256-272, 343-345, 377-390, 406-407. 
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samples. Significant peptides (P< 0.01) (Fig. 2b) are coloured and listed at the bars of+2s.d. 
bottom of each panel. Secondary-structure elements from the structures of 


SPT16 res. 560-568 | SSRP1 res. 130-138 


T T T "T T "T T T 
84 10' 10? 10% 104 C 10' 10? 10% 104 
6 Time (s) ) Ga Time (s) 6 


104 SPT16 res. 628-639 


Deuterium Uptake (Da) 
B 
1 
Deuterium Uptake (Da) 


SSRP1 res. 180-186 


Deuterium Uptake (Da) 


T mr T T 
10' 10% 10° 104 
14 Time (s) 

| SPT16 res. 901-916 


Deuterium Uptake (Da) 
wo 
1 


T T T 
104 10' 102 10° 104 
38 Time (s) 
10 
4 SSRP1 res. 203-214 


Deuterium Uptake (Da) 
o 
ee 


z 
OQ 8 
2] 3 
Da arate a8 
i 402. 408 404 
10' 102 10° 10 a 
Time (s) 3 
a 
6 3° 
@ ,,| SPT16 res. 864-870 an) 
cs] 10' 102 10° 104 
$44 Time (s) 
a | 10 
= 3 = SSRP1 res. 256-272 
324 Qa e4 
$ aaa 2 
$4 aed 
8 a 
O-frer 7 T mr = 
10’ 102 103 104 ES ‘| 
Time (s) 2 24 
7 fal 
@ ,| SPT16 res. 807-815 Dope cerraste rer tra 
a6 10' 102 10° 104 
g 5 Time (s) 
a4 @ | SSRP1 res. 344-352 
E 3 Q 
22 2 
5 
31 3 
mS r E 
1 402. 408 404 3 
ee oe Ss SSRP1 res. 377-392 i 
Time (s) Qa 124 2 
@ 104 a 
& af 10' 102 10° 104 
= 
e 64 ——™ Time (s) 
E 
3 4] — FACT+sub-nucleosomes 
5S 24 
a 0 — FACT 
trent 
10’ 102 10° 104 


Time (s) 
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Extended Data Fig. 6 | Effects of FACT deletion mutants on DNA binding. 

a, Full-length (FL) FACT and FACT CTD deletions (SPT16AC-SSRP1, SPT16- 
SSRPIAC or SPT16AC-SSRPIAC) were purified from Sf9 cells and visualized on 
4-12% SDS-PAGE, stained with Blazin blue. This particular gel was run only 
once, although proteins were purified and analysed by SDS-PAGE four times. 
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b, As for Fig. 2e, except that full-length FACT was pre-incubated with an 
equimolar amount of H2A-H2B dimer at 400 nM. The pre-incubation of wild- 
type FACT with H2A-H2B dimer has a similar effect on DNA binding as FACT 
ACTDs. Visualized by Atto N (647). The experiment was done twice with similar 
results. 
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a 
Residues with significant AHDX between H2A-H2B-FACT and H2A-H2B+FACT+tetrasome at 100 s (Fig. 3b): 
64-87, 94-113. 
b H2B Coverage Map 
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Residues with significant AHDX between FACT and FACT+sub-nucleosomes at 100 s (Fig. 3b): 
71-93. Note residues 96-113 shows a change at 21000 s only. 
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structure elements are shown. c, Deuterium-uptake plots forexamplepeptides peptide coverage are in grey. 
that showa significant change (P< 0.01) (Fig. 3c) when comparing H2A-H2B 
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Extended Data Fig. 8 | Interactions between H3-H4 and the SPT16 middle 
domain are incompatible with FACT interactions made with the 
subnucleosome. a, Domain structure of SPT16, and cartoon showing the 
overall architecture. b, Interactions between H3-H4 and the SPT16 MD 
(PDB code 4Z2M) are incompatible with FACT interactions made with the 
subnucleosome. These structures are for complex 1, which shows only the 
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SPT16 MD. Twosites of interaction with the subnucleosome are outlined 
(interface 1, near the H2A-docking domain and nearby DNA; interface 2, near 
the H4.N-tail and DNA).c, The presence of DNA and the H2A-H2B dimer 
completely occludes interactions between the SPT16 MD and the (H3-H4), 
tetramer in the 4Z2M structure (clashes 1and 2, indicated by red arrows). 
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Extended Data Table 1| Summary of cryoEM data collection and refinement 


Cryo-EM data collection, refinement and validation statistics 


Class | Class 2 
(EMDB-20840) (EMDB-20841) 
(PDB 6UPK) (PDB 6UPL) 
Data collection and processing 
Magnification 105,000 
Voltage (kV) 300 
Electron exposure (e—/A’) 70 
Defocus range (um) 1.5-2.5 
Pixel size (A) 1.0961 
Symmetry imposed Cl Cl 
Initial particle images (no.) 162,798 
Final particle images (no.) 16,317 6990 
Map resolution (A) 4.9 74 
FSC threshold 0.143 
Map resolution range (A) 49-123 6.5 — 20.0 
Directional resolution range (A) 4.46.7 6.2-9.4 
Sphericity of 3DFSC 0.95 0.87 
Refinement 
Initial model used (PDB code) 3LZ0,3AFA,4Z2M,4IFS 
Model resolution (A) 5.25 73 
FSC threshold 0.143 0.143 
Model resolution range (A) 4.8-7.1 6.6-7.9 
Map sharpening B factor (A’*) 151.0 382.7 
Model composition 
Non-hydrogen atoms 13861 15882 
Protein residues 1368 1589 
Nucleotide 144 158 
B factors (A’)* 
Protein 524 408 
Nucleotide 470 4ll 
R.m.s. deviations 
Bond lengths (A) 0.006 0.005 
Bond angles (°) 1.101 1.090 
Validation 
MolProbity score 2.26 2.25 
Clashscore 20.32 21.65 
Poor rotamers (%) 0.00 0.23 
Ramachandran plot 
Favored (%) 92.85 93.55 
Allowed (%) 715 6.32 
Disallowed (%) 0.00 0.13 


To obtain the best fit between model and map, we applied real-space refinement of atomic displacement parameters (ADPs, or 
B-factors) in PHENIX. The B-factors increase substantially during ADP real-space refinement. These numbers do not represent the 
same physical 

meaning of B-factors as in crystallography®°. 
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Software and code 


Policy information about availability of computer code 


Data collection CryoEM data was collected by Leginon software package. HDX data was collected by HDMS and HDMSE modes in MassLynx v4.1. 
Florescence polarization assay data was collected by CLARIOstar. 


Data analysis CryoEM data analysis was performed by MotionCor2 (1.2.1), CTFFind4 (4.1), DoG picker (1), Appion (3.2), Relion 3.0, cryoSPARC (2), 
Daniel Asarnow's pyem script (unpublished, https://github.com/asarnow/pyem), UCSF Chimera 1.11.2, PHENIX 1.12-2829, Coot 0.8.9 and 
SWISS-MODEL. Electrostatic surfaces were calculated with Pymol 2.3.2 (Apbs Tools 2.1). HDX data analysis was carried out by Waters 
Protein Lynx Global SERVER 3.0.2 and Dynamx 3.0. Florescence polarization data was analyzed by Graphpad Prism 7. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


HDX-MS raw data is available in SI tables 2 and 3. Atomic coordinates and cryo-EM maps have been deposited in the Protein Data Bank (PDB) and Electron 
Microscopy Data Bank (EMDB) under accession codes xxxx and xxxx, according to policies. 
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Data exclusions | Low quality data from cryoEM was excluded in Relion and cryoSPARC to reach high resolution with criteria pre-established in Relion and 
cryoSPARC. No data was excluded in both HDX and binding assays. 


Replication All experiments were performed with multiple biological replicates, indicated in figure legends. All of them are reproducible. 
Randomization _ This is not relevant in this study because protein structural data, HDX data and binding assays (EMSA and FP) could not be randomized. 


Blinding The investigators were not blinded on this study. No bias was introduced during data collection and data analysis in any assay in this study. 
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Class B G-protein-coupled receptors are major targets for the treatment of chronic 
diseases, including diabetes and obesity’. Structures of active receptors reveal 
peptide agonists engage deep within the receptor core, leading to an outward 
movement of extracellular loop 3 and the tops of transmembrane helices 6 and 7, an 
inward movement of transmembrane helix 1, reorganization of extracellular loop 2 
and outward movement of the intracellular side of transmembrane helix 6, resulting 
in G-protein interaction and activation’ ©. Here we solved the structure of anon- 
peptide agonist, TT-OAD2, bound to the glucagon-like peptide-1 (GLP-1) receptor. Our 
structure identified an unpredicted non-peptide agonist-binding pocket in which 
reorganization of extracellular loop 3 and transmembrane helices 6 and 7 manifests 
independently of direct ligand interaction within the deep transmembrane domain 
pocket. TT-OAD2 exhibits biased agonism, and kinetics of G-protein activation and 
signalling that are distinct from peptide agonists. Within the structure, TT-OAD2 
protrudes beyond the receptor core to interact with the lipid or detergent, providing 
an explanation for the distinct activation kinetics that may contribute to the clinical 
efficacy of this compound series. This work alters our understanding of the events 
that drive the activation of class B receptors. 


Class B peptide G-protein-coupled receptors (GPCRs) regulate the 
control of glucose and energy homeostasis, bone turnover, and car- 
diovascular development and tone’. Several peptide agonists are 
clinically approved for disorders of energy and bone metabolism’; 
however, attempts to develop non-peptide, orally available analogues 
have yielded only limited success. Understanding the structural basis 
of class B GPCR activation is crucial to the rational development of 
peptidic and non-peptidic drugs. Recent structural determination 
of full-length, active class B receptors bound to peptide agonists? ® 
confirmed that the N terminus of the peptide ligands, required for 
receptor activation, binds deep within the seven-transmembrane 
helical bundle. This is associated with an outward movement of the 
tops of transmembrane helices (TM) 6 and 7 (and interconnecting 
extracellular loop (ECL) 3) and a large kink in the centre of TM6 that 
opens up the intracellular face of the receptor to allow G-protein 
coupling” *””°. In parallel, a conformational reorganization of ECL2 
and an inward movement of TMI facilitates peptide interaction and 
receptor activation. 

The GLP-1 receptor (GLP-1R) is an established therapeutic target for 
type 2 diabetes and obesity”. Despite their clinical success, GLP-1R 
peptide drugs are suboptimal owing to their route of administration 


and side-effect profiles, most notably nausea and vomiting that reduce 
patient compliance". For many years, oral GLP-1R agonists have been 
pursued, with recent studies reporting promising clinical trial data for 
oral semaglutide—a new formulation of the approved peptide sema- 
glutide”?. However, it induced slightly greater severity of nausea and 
gastrointestinal side effects than those observed with injectable GLP-1 
mimetics. Future development of non-peptide drugs could offer more 
traditional small molecule absorption characteristics that may assure 
better long-term patient compliance with the potential for reduced gas- 
trointestinal liability, especially for patients who are co-administering 
with other medications. 

Several non-peptidic GLP-1R agonists have been identified". One 
class form covalent interactions with C347°* (in which the superscript 
denotes the Wootten class B GPCR numbering) and are predicted to 
allosterically disrupt polar networks at the base of the receptor, pro- 
moting activation’, whereas other small molecule compounds bind 
to unknown sites at the receptor extracellular face”. However, it is 
assumed that these molecules may need to mimic key interactions of 
the peptide N terminus deep within the transmembrane core to initiate 
receptor activation, as is seen for short stabilized 11-mer peptides, that 
occupy an overlapping site to full-length peptides’. 


'Drug Discovery Biology, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia. ?School of Biological Sciences, University of Essex, Colchester, UK. Lilly 
Research Laboratories, Eli Lilly and Company, Indianapolis, IN, USA. “Graduate School of Pharmaceutical Sciences, Tohoku University, Sendai, Miyagi, Japan. °The National Center for Drug 
Screening and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China. °School of Pharmacy, Fudan University, 
Shanghai, China. Mayo Clinic, Scottsdale, AZ, USA. °Graduate School of Medicine, University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan. °These authors contributed equally: Peishen Zhao, 
Yi-Lynn Liang, Matthew J. Belousoff, Giuseppe Deganutti. *e-mail: rado@m.u-tokyo.ac.jp; patrick.sexton@monash.edu; denise.wootten@monash.edu 
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Fig. 1| Pharmacology exhibited by TT-OAD2 relative to GLP-1. a, Chemical 
structure of TT-OAD2. b, Plasma insulin induced by GLP-1(10 pg kg"), TT-OAD2 
(3mgkg”) or gastric inhibitory polypeptide (GIP; 25g kg”) inanacuteIVGTT 
on humanized GLP-1R knock-in (KI) and GLP-1R knockout (KO) mice. c, Whole- 
cell binding assays showing the ability of GLP-1and TT-OAD2 to displace 
25].exendin(9-39). d, cAMP accumulation, intracellular calcium mobilization, 
B-arrestin-1 recruitment and ERK1/2 phosphorylation (pERK1/2). e, Agonist- 
induced changes in trimeric G, conformation in cell plasma membrane 
preparations for GLP-1 (left) and TT-OAD2 (middle). Rates (top right) and 
plateau (bottom right) at saturating concentrations (14M GLP-1, 10 pM TT- 
OAD2) were quantified by applying a one-phase association curve. f, Kinetics 
of cAMP production measured by an EPAC biosensor for GLP-1 (left) and 
TT-OAD2 (middle). Rates were quantified using approximate EC,, and E,,,, 
concentrations (1nM and 0.1 1M for GLP-1, 0.1 1M and 10 uM for TT-OAD2) by 
applying a one-phase association curve. Ine and f, arrows refer to the time at 
which ligand or vehicle was added. Parameters derived from kinetic data are 
represented as scatter plots with each individual experiment shown by black 
circles. Allexperiments were performed in GLP-1R expressing HEK293A cells. 
Datainbare mean+s.e.m. from 4-5 mice per treatment, representative of 3 
independent experiments. Datainc-fare mean +s.e.m. of 4-5 independent 
experiments (in duplicate or triplicate). *P< 0.05, Student’s paired f-test. 


Here we investigate TT-OAD2 (Fig. 1a), anon-peptidic compound 
reported in the patent literature and part of the chemical series that 
contains the vTv Therapeutics investigational drug candidate, TTP273. 
TTP273,an orally administered GLP-1R agonist, successfully completed 


phase Ila efficacy trials for type 2 diabetes (ClinicalTrials.gov Identi- 
fier: NCTO2653599), in which it met its primary endpoint, reducing 
levels of glycated haemoglobin in patients with type 2 diabetes, with no 
reported cases of nausea”, suggesting a potential clinical advantage for 
compounds of this series. Little has been disclosed about the molecular 
properties of this compound series; however, recent progression of 
TTP273 has been hampered by unexpected complexity in identifying 
optimal dosing that may be linked to a lack of understanding of its 
mechanism of action. Assessment of acute in vivo activity in human- 
ized GLP-1R mice revealed that TT-OAD72 is insulinotropic and that this 
effect is dependent on the GLP-IR (Fig. 1b). 


TT-OAD72 is a biased agonist with slow kinetics 


In HEK293 cells that overexpress GLP-1R, TT-OAD2 only partially dis- 
placed the orthosteric probes *I-exendin(9-39) and ROX-exendin-4 
(Fig. 1c, Extended Data Fig. 1a), consistent with an allosteric mode of 
interaction’®. Although GLP-1R signals to several cellular pathways, TT- 
OAD2 activated only a subset of these responses; it was alow-potency 
partial agonist for cAMP accumulation, with only weak responses 
detected for mobilization of intracellular Ca”* and phosphorylation of 
ERK1/2 at very high concentrations (100 1M) (Fig. 1d) and no detectable 
recruitment of B-arrestin-1. These data are indicative of bias towards 
cAMP and away from these other pathways relative to endogenous 
GLP-1. There is considerable interest in exploiting biased agonism at 
GPCRs to maximize the beneficial effects of receptor activation, while 
minimizing on-target side-effect profiles. 

CRISPR-engineered HEK293 cells in which G,,,, or G,,,.,, proteins were 
deleted revealed that G, was essential for the production of cAMP; 
however, this response, for both ligands, was also dependent on the 
presence of G,,.,, proteins. (Extended Data Fig. 1b). Assessment of proxi- 
mal activation of G, and G, transducers using split luciferase NanoBit 
G-protein sensors (Extended Data Fig. 1c) determined GLP-1-decreased 
luminescence in a bi-phasic, concentration-dependent, manner for 
bothG proteins with similar potencies in each phase. For TT-OAD2, the 
G, sensor gave a similar decrease in luminescence to GLP-1; however, 
enhanced luminescence was observed for the G, sensor, which suggests 
a different mechanism of G, activation. To probe these differences fur- 
ther, we used membrane-based assays of bioluminescence resonance 
energy transfer (BRET) G-protein sensors to assess the rate and nature 
of the G, conformational change. In contrast to the rates of change in 
the conformation of G,, which were similar for both ligands (Extended 
Data Fig. 1), there was a marked distinction in kinetics for G, coupling. 
GLP-1 promoted a rapid conformational change inG, protein, whereas 
for TT-OAD2 this was very slow (Fig. le). However, both agonists induced 
asimilar plateau of the measured response (Fig. le) that was reversed 
by excess GTP (Extended Data Fig. 1d), indicative of a similar overall 
conformational rearrangement. Together, this suggests that slower 
G, conformational transitions, required for the exchange of GDP for 
GTP andG, activation, would result in lower turnover of G protein and 
rate of cAMP production by TT-OAD2. Direct kinetic measurements 
of cAMP production validated this hypothesis (Fig. 1f, Extended Data 
Fig. le). Overall, these data revealed TT-OAD2 as a biased agonist that 
can only activate a subset of pathways with limited efficacy and with 
distinct activation kinetics relative to peptide agonists. 


TT-OADZ2 has an unexpected binding mode 


To understand how TT-OAD2 binds and activates the GLP-1R, we deter- 
mined the GLP-1R structure bound to TT-OAD2 and the transducer 
heterotrimeric G, protein (Fig. 2). Complex formation was initiated 
in Tniinsect cells by stimulation with 50 pM TT-OAD2, and complexes 
were then solubilized and purified (Extended Data Fig. 2a). Vitrified 
complexes were imaged by single-particle cryo-electron microscopy 
(cryo-EM) ona Titan Krios. Following 2D and 3D classification, the most 
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Fig. 2|TT-OAD2-GLP-1R-G, cryo-EM structure reveals non-peptide binding 
site. Top, orthogonal views of the TT-OAD2-GLP-1R-G, complex cryo-EM map 
(left) and the structure after refinement in the cryo-EM map (right), colour- 
coded to protein chains; GLP-1R (blue), TT-OAD2 (red), heterotrimeric G, 

(a: gold, B: dark cyan, y: purple, Nb35: salmon). Middle, TT-OAD2 interacts 
with the top of the GLP-1R bundle. Interacting residues of GLP-1R (blue) with 
TT-OAD2 (red). Bottom, TT-OAD2-mediated cAMP production by receptors 
containing alanine mutants of key residues assessed in ChoFlpIn cells. Dataare 
mean +s.e.m. of four independent experiments performed in duplicate. WT, 
wildtype. 


abundant class was resolved to 3.0 A (Extended Data Fig. 2c-f, Sup- 
plementary Table 1). The cryo-EM density map allowed unambiguous 
assignment of the TT-OAD2-binding site and pose, and clear rotamer 
placement for most amino acids within the receptor core and G protein 
(Fig. 2, Extended Data Figs. 3, 4a, b). The GLP-1R extracellular domain 
(ECD) and the Ga, a-helical domain were not resolved at high resolu- 
tion, consistent with their greater mobility. Rigid body fitting of an 
available X-ray structure of the GLP-IR ECD domain (PDB code3C5T)”” 
was performed into the density to generate a full-length model. 
TT-OAD2 bound high up in the helical bundle interacting with res- 
idues within TMI, TM2, TM3, ECL1 and ECL2 (Fig. 2, Extended Data 
Fig. 4a). Most interactions are hydrophobic in nature (Fig. 2), includ- 
ing numerous 1-1 stacking interactions between receptor aromatic 
residues and phenolic regions within the ligand. Unexpectedly, TT- 
OAD2 adopts a ‘boomerang-like’ orientation within the binding site 
with the 3,4-dichloro-benzyl ring of TT-OAD2 protruding beyond the 
receptor core through transmembrane helices 2 and 3, interacting 
with W2037”, and embedding in the detergent micelle, consistent with 
probable interactions with the lipid bilayer ina native system. F230** 
and W297‘ interact with the 2,3-dimethyl-pyridin-4-yl-phenol region, 
Y220*"' forms a hydrogen bond with the 2,3-dimethyl-pyridine ring 
and K197?” forms a polar interaction with the propionic acid part of 
the ligand. Additional hydrophobic contacts are formed with TT-OAD2 
by Y1451*°, L20174, 11967, A20079, L217®*"!, V229?? and M204?° 
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(Fig. 2, Extended Data Fig. 4a). Molecular dynamics simulations of the 
TT-OAD2-GLP-1R-G, complex predicted further transient interactions 
with TM1, TM2, TM3, ECL1, ECL2 and the ECD of GLP-1R (Extended 
Data Table 1). Assessment of TT-OAD2-induced cAMP production 
at alanine mutants of key receptor residues within the binding site 
revealed reduced potency (negative logarithm of the half-maximal 
effective concentration, pEC;)), reduced maximal responses (E,,,,) OF 
both relative to the wild-type receptor (Fig. 2, Supplementary Table 2). 
Application of the operational model of agonism revealed these muta- 
tions directly alter TT-OAD2 functional affinity (K,) and/or efficacy (1) 
(Supplementary Table 2), which highlights the importance of these 
residues in TT-OAD2 function. 


Peptide versus non-peptide binding sites 

The TT-OAD2-binding pose has very limited overlap with full-length 
peptides, GLP-1 and exendin-P5 (ExP5)** (Fig. 3, Extended Data Fig. 5). 
Structural comparisons, combined with associated molecular dynam- 
ics simulations performed on models generated from the cryo-EM data, 
identified only 10 out of 29 residues that interact with both TT-OAD2 
and GLP-1. Moreover, the persistence and nature of ligand interactions 
formed by common residues differed (Fig. 3c, Extended Data Table 1). 
In contrast to TT-OAD2, peptide ligands engage transmembrane heli- 
ces 5-7 in addition to extensive interactions deep within the bundle 
in transmembrane helices 1-3 (Fig. 3, Extended Data Fig. 5, Extended 
Data Table 1). 

The relatively limited overlap between the peptide- and TT-OAD2- 
binding sites suggests that this compound series may modulate pep- 
tide function in a physiological setting. To address this, we assessed 
the effect of TT-OAD2 on the signalling of two physiological ligands 
(Extended Data Fig. 6). TT-OAD2 inhibited GLP-1- and oxyntomodu- 
lin-mediated cAMP, calcium, pERK1/2 and B-arrestin responses ina 
concentration-dependent manner (Extended Data Fig. 6). This suggests 
that the profile of signalling observed from the GLP-1R when using 
TT-OAD2-like compounds as drugs may depend on the dose adminis- 
tered; at high concentrations, their presence would probably inhibit all 
endogenous peptide effects, biasing receptor responses primarily to 
cAMP formation mediated by the compound itself. However, at lower 
concentrations, some endogenous peptide signalling may still occur. 
Notably, TTP273 was reported to exhibit greater clinical efficacy at 
lower concentrations, indicating that maintenance of some aspects 
of physiological signalling may be important for clinical efficacy”. 


GLP-1R conformational changes and activation 
At agross level, the TT-OAD2-complexed GLP-1R helical bundle displays 
the key hallmarks of activated, peptide-occupied, class B GPCRs? °. 
At the extracellular face, this includes the large outward movement 
of TM6, ECL3 and TM7, inward movements of TM1, helical extensions 
within TM2 and TM3, areordering of ECL1, and conformational transi- 
tions within ECL2 that increases upward towards the extracellular side 
(Extended Data Fig. 5). At the intracellular side, there is an equivalent 
large outward movement of TM6 away from the centre of the helical 
bundle, and the smaller outward movement of TMS. It is important to 
note that the fully active state is driven in part by allosteric conforma- 
tional changes, including those in the extracellular face, linked to G 
protein binding”. Nonetheless, all the GLP-1R structures are solved with 
the same G protein yet reveal conformational differences at their extra- 
cellular face, including within the extent of movement of TM6, ECL3, 
ECL7 and the conformation of the ECD, TM2-ECL1 and ECL2 that are 
linked to the bound agonists (Fig. 3a, b, Extended Data Fig. 5b, c). This 
suggests that distinct receptor activation triggers converge to common 
changes at the intracellular face that allow coupling to transducers. 
Although the low resolution of the receptor ECD for the TT-OAD2 
complex indicates extensive mobility, it occupied a distinct orientation 
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Fig. 3| Comparisons of GLP-1R conformations induced by GLP-1and TT- 
OAD2. a,b, Superimposition of the GLP-1R from PDB 5VAI (GLP-1R or G protein: 
orange, GLP-1: green) andthe TT-OAD2 structure (GLP-1R or G protein: blue, TT- 
OAD2: red) reveals partial overlap of peptide- and TT-OAD2-binding sites and 
conformational differences inthe receptor. a, Left, full complex; middle, close 
up of ECD and the top of the seven-transmembrane (7-TM) bundle; right, close 
up of the transmembrane bundle. b, Left, 16 A, 7 Aand 6A differences occur in 
the location of TM6/ECL3, TM7 and TMI, respectively. Middle, a4Ashiftinthe 
location of the top of TM2 result in distinct conformations of ECL1. Right, the 
intracellular region of the GLP-1R helical bundles have similar overall backbone 
conformations. c, Comparison of the GLP-IR-T T-OAD2 and GLP-1R-GLP-1 
contacts during molecular dynamics simulations performed onthe GLP-1R-TT- 
OAD2-G, and GLP-1R-GLP-1-G, complexes. Top (left) and side (right) views of 
the GLP-1R transmembrane domain (ribbon representation, TT-OAD2 in red 
sticks, GLP-1 not shown). TT-OAD2 made contacts (red coloured ribbon) with 
ECL1and residues located at the top of TM2 and TM3. GLP-1 was able to engage 
TMS, TM6 and TM7 of the receptor and side chains located deep in the bundle 
(blue coloured ribbon). Residues that are involved both in the GLP-1R-TT- 
OAD2-G, and GLP-1R-GLP-1-G, complexes are indicated by asterisks, and 
coloured according to the algebraic difference in occupancy (contact 
differences in percentage frames) between GLP-1R-T T-OAD2-G, and GLP-1R- 
GLP-1-G,. Red indicates regions more engaged by TT-OAD2 and blue more 
engaged by GLP-1. The ECD is not shown. Plotted data are summarized in 
Extended Data Table 1. 


relative to the transmembrane core in comparison to peptide-bound 
complexes, whereas both GLP-1- and ExP5-bound receptors stabilized 
a similar conformation*® (Extended Data Fig. 5a). Similarly, the short 
11-mer peptide HepP5 forms few interactions with the ECD” and occu- 
piesa distinct orientation relative to GLP-1 and ExP5, but this conforma- 
tion also differs from that stabilized by TT-OAD2 (Extended Data Fig. Sc). 
The cryo-EM map of the TT-OAD2-bound receptor complex supports 
extended interactions of the ECD with ECL1 and ECL2 (Extended Data 
Fig. 4c) and this is supported by molecular dynamics simulations that 
predicts interactions of R40* with D215°“' and E34* with R299" 
(Extended Data Table 2). This later interaction is particularly important 
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Fig. 4| TT-OAD2 interactions lead to reorganization and stabilization of the 
central polar network viaa distinct mechanism to GLP-1. Summaries of 
interactions observed in molecular dynamics simulations (Supplementary 
Video 2) on TT-OAD2- and GLP-1-bound GLP-1R that predict interactions 
stabilizing the active conformation of the central polar network. a, Left, GLP-1 
(brownribbon) residue D? (brown stick) forms an ionic interaction (red dotted 
lines) with R1902°, which is involved in key hydrogen bonds with N24.0>*7 (in 
turn interacting with $1867°°). At the top of TM2, K197?”, D198? and Y145'*° 
are stabilized in polar interactions (red dotted lines). Right, TT-OAD2 (brown 
stick and transparent surface) forms ionic interaction (red dotted lines) with 
K197?-’ and hydrophobic contacts with Y145' and Y148" (cyan transparent 
surfaces) modifying the interaction network at the top of TMI. Y148'? 
transiently interacts with R190* and partially reorients N24.0>*7 and S1867°°. 
TM6and TM7 were removed for clarity. b, GLP-IR transmembrane helix sites 
are occupied by structural water molecules; blue spheres indicate receptor 
volumes occupied by low-mobility water molecules (occupancy more than 
75% frames). Left, the GLP-IR-GLP-1-G, complex stabilizes the central 
transmembrane polar residues by waters interacting with Y152'”, T391’*°, 
R1902° and E364*? (Supplementary Video 1). Right, the GLP-IR-TT-OAD2-G, 
complex is characterized by structural water molecules interacting with 
N320°*° and E364°*? (Supplementary Video 1). 


as R299"? directly, and stably interacts with peptide ligands, but in 
the TT-OAD2-bound receptor, stabilizes the N terminus of the ECD in 
a position that may have an analogous role to the peptide in stabilizing 
ECL2. Indeed, in our models, the position of the far N-terminal ECD 
helix overlapped with the location of the C-terminal region of GLP-1 
and ExP5 when comparing the TT-OAD2- and peptide-bound structures 
(Fig. 3a). Thus, the ECD is likely to be important for both stabilizing the 
TT-OAD2-binding site and facilitating receptor activation, as previously 
proposed for different classes of peptide ligands””’. 

Distinctions from peptide-bound receptors observed within TM2/ 
ECL1and ECL2 (Fig. 3b) are probably driven by direct ligand interactions 
by TT-OAD2 (Fig. 2), whereas those within TM6 and TM7 by direct inter- 
actions formed by peptide agonists. Molecular dynamics simulations 
also support a role of membrane lipid interactions in directly stabilizing 
both these regions within the TT-OAD2-bound structure (Extended Data 
Fig. 7). Notably, the helical bundle of the TT-OAD2-complexed receptor 
isinamore open conformation than the peptide-occupied receptors, 
largely owing to the top of TM6/ECL3, TM7 and TMI residing 16 A, 6A 
and 7A further outwards relative to the GLP-1-bound structure (meas- 
ured from the Ca atoms of D372°*“, F3817” and P137!, respectively 
(Fig. 3b). The orientation of TM6, ECL3 and TM7 also differs between 
ExP5- and GLP-1-bound structures, with ExP5 adopting a more open 
conformation’; however, the outward positioning of ECL3 induced 
by TT-OAD2 is much larger (Extended Data Fig. 5b). Peptide-bound 
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structures of all solved class B GPCRs revealed direct interactions of 
the engaged peptide with residues within TMS, TM6, TM7 and ECL3 
with the peptide volume (minimally) presumed to actively contribute 
to the outward conformational change in this region? **°, In the apo- 
state of the glucagon receptor, interactions occur between ECL3 and 
the ECD that contribute to maintenance of receptor quiescence’”***”®, 
Molecular dynamics simulations onthe GLP-1R structures, performed 
after the removal of either TT-OAD2 or GLP-1, predict that the GLP-IR 
ECDalso adopts both open and closed conformations in the apo-state, 
in which it can form transient interactions with both ECL2 and ECL3* 
(Extended Data Fig. 8). Combining this information with the GLP-1R 
active structures suggests that interactions, with either peptide or 
non-peptide agonists, can release ECL3-ECD constraints, lowering the 
energy barrier for receptor activation. However, the degree of ligand 
interaction with TM6-ECL3-TM7 determines the extent to which the 
transmembrane bundle opens, and this in turn directly contributes to 
G-protein efficacy and biased agonism, as these regions (TM6-ECL3- 
TM7 and TMI) have been identified as key drivers for these phenomena, 
particularly for the GLP-IR*”’*’. 

Despite the different binding modes, commonalities observed in 
interactions with TT-OAD2 and peptide with transmembrane helices 
1-3 and stabilization of ECL2 are sufficient to initiate conformational 
transitions that propagate to a similar reorganization of the class B 
GPCR conserved central polar network that is linked to activation, 
albeit the mechanism for this differs for peptide agonists versus TT- 
OAD2 (Fig. 4a, Supplementary Video 1, Extended Data Fig. 9). Molecular 
dynamics simulations of the GLP-1-bound GLP-1R predicted persistent 
interactions between Y152"*’, R1907°, Y241? and E364°*? and the 
N terminus of GLP-1 that directly engage the central polar network 
(Fig. 4a, Extended Data Tables 1, 2, Supplementary Video 1). By contrast, 
TT-OAD influences the central polar network allosterically via interac- 
tions with K197*, Y145"° and Y148'°. TT-OAD2 also promotes unique 
hydrogen bond networks with crucial residues in TM2 (Fig. 4a, Extended 
Data Table 2) that result in different interaction patterns at the top of 
TM1 and TM2 relative to peptide-occupied receptors. These effects 
propagate to the polar network through transient contacts between 
TT-OAD2 with Y148'* and Y152'’ that in turn interact with R190? of 
the central polar network (Supplementary Video 2). When bound by 
GLP-1, the polar network is stabilized by ligand and a network of water 
molecules, whereas for TT-OAD2, this occurs via a distinct network of 
structural waters rather than by the ligand (Fig. 4b, Supplementary 
Video 1). These differences in the mechanism of conformational tran- 
sitions and stabilization of conserved polar networks (summarized in 
Extended Data Fig. 9) may contribute to the different kinetic profiles 
of G-protein activation, as well as the full versus partial agonism for 
cAMP production. 

Collectively, our work provides key advances in understanding 
the activation of class B GPCRs and G, protein efficacy, identifying a 
non-peptide binding site within the GLP-1R that can promote distinct 
efficacy and biased signalling relative to peptide ligands, and this may 
extend to other class B GPCRs. The demonstration that non-peptide 
agonists of the GLP-1R are not required to mimic the extensive recep- 
tor contacts formed by peptides within the transmembrane cavity to 
promote receptor activation will advance the pursuit of non-peptide 
agonists for therapeutically important class B receptors. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation. 


TT-OAD2 synthesis 

Several azoanthracene-based derivatives are reported as potent 
agonists of the GLP-1R (WO10114824), and a compound from this 
series known as OAD2 was selected for our studies (WO14113357). 
OAD2, (S)-2-{[(3S,8S)-3-[4-(3,4-dichloro-benzyloxy)-phenyl]-7-((S)-1- 
phenyl-propyl)-2,3,6,7,8,9-hexahydro-[1,4]dioxino[2,3-g]isoquinoline- 
8-carbonyl]-amino}-3-[4-(2,3-dimethyl-pyridin-4-yl)-phenyl]-propionic 
acid, was synthesized using procedures previously described (see exam- 
ple 179 in WO10114824), anda dihydrochloride salt form (OAD2.2HCI) 
was prepared by standard methods from the free base. Therefore, TT- 
OAD2 is the dihydrochloride salt of OAD2 in patent WO14113357. The 
purity of TT-OAD2 was determined by liquid chromatography-mass 
spectrometry (LC-MS) to be 98.62%. 


Constructs 

GLP-1R was modified to contain either a2xcMyc-N-terminal epitope tag 
(for signalling and radioligand-binding assays) or a Nanoluc tag (with 
a12xGly linker; for NanoBRET binding studies) after the native signal 
peptide. For B-arrestin recruitment assays, a C-terminal Rluc8 was fused 
tothe Cterminus of the receptor. For G-protein conformational assays, 
a Nanoluc flanked by SGGGGS linkers was inserted into Ga, and Ga;, 
after G(h1hal0) in Ga, or E(HA.03) in Ga, as previously described*>”. 
These were used in conjunction with an N-terminally Nluc-labelled Gy,. 
For G-protein steady-state assays, G-protein NanoBit-split luciferase 
constructs were generated by fusing the LgBIT after G(h1ha10) in Ga, 
or E(HA.29) in Ga,, and the SmBIT to Gy, For structural studies, human 
GLP-1R in the pFastBac vector was modified to include an N-terminal 
Flag tag epitope and a C-terminal 8xhistidine tag; both tags are remov- 
able by 3C protease cleavage. These modifications did not alter the 
pharmacology of the receptor®. A dominant-negative Ga, construct 
was generated previously by site directed mutagenesis to incorporate 
mutations that alter nucleotide handling, stabilize the G, state and 
interactions with the By subunits*’. 


Insect cell expression 

GLP-1R, human dominant-negative Ga,, His6-tagged human GB, and Gy, 
were expressed in 7niinsect cells (Expression systems) using baculovi- 
rus. Cell cultures were grown in ESF 921 serum-free media (Expression 
Systems) to a density of 4 million cells per ml and then infected with 
three separate baculoviruses at a ratio of 2:2:1 for GLP-1R, dominant- 
negative Ga, and GB,y,. Cells were obtained by centrifugation 60 h 
after infection and the cell pellet was stored at —80 °C. 


Purification of the TT-OAD2-GLP-1R-G, complex 

Cell pellet was thawed in 20 mM HEPES, pH 7.4, 50 mM NaCl, 2 mM 
MgCl, supplemented with cOmplete Protease Inhibitor Cocktail tablets 
(Roche). Complex formation was initiated by addition of 50 pM TT- 
OAD2, Nb35-His (10 pg mI) and apyrase (25 mU mI”, NEB) to catalyse 
hydrolysis of unbound GDP and allow for stabilization of the G, state; 
the suspension was incubated for 1h at room temperature. Membrane 
was solubilized by 0.5% (w/v) lauryl maltose neopentyl glycol (LMNG, 
Anatrace) supplemented with 0.3% (w/v) cholesteryl hemisuccinate 
(CHS, Anatrace) for 2 hat 4 °C. Insoluble material was removed by 
centrifugation at 30,000g for 30 min and the solubilized complex 
was immobilized by batch binding to M1 anti-Flag affinity resin in the 
presence of 3mM CaCl.. The resin was packed into a glass column and 
washed with 20 column volumes of 20 mM HEPES pH 7.4, 100 mM NaCl, 
2mMMgCl,,3mM CaCL,, 1M OAD, 0.01% (w/v) MNG and 0.006% (w/v) 


CHS before bound material was eluted in buffer containing 5 mM EGTA 
and 0.1mg mI‘ Flag peptide. The complex was then concentrated using 
an Amicon Ultra Centrifugal Filter (molecular mass cut off 100 kDa) 
and subjected to size-exclusion chromatography on a Superdex 200 
Increase 10/300 column (GE Healthcare) that was pre-equilibrated 
with 20 mM HEPES pH 7.4, 100 mM NaCl, 2mM MgCl, 111M OAD, 0.01% 
(w/v) MNG and 0.006% (w/v) CHS to separate complex from contami- 
nants. Eluted fractions consisting of receptor and G-protein complex 
were pooled and concentrated. Final yield of purified complex was 
approximately 0.15 mg per litre of insect cell culture. 


Electron microscopy 

Samples (3 pl) were applied to a glow-discharged Quantifoil R1.2/1.3 
CuRh 200 mesh holey carbon grid (Quantifoil GmbH) and were flash 
frozen in liquid ethane using the Vitrobot mark IV (Thermo Fisher 
Scientific) set at 100% humidity and 4 °C for the prep chamber. Data 
were collected ona Titan Krios microscope (Thermo Fisher Scientific) 
operated at an accelerating voltage of 300 kV with a50 um C2 aperture 
at an indicated magnification of 105 K in nanoprobe EFTEM mode. 
Gatan K3 direct electron detector positioned post a Gatan Quantum 
energy filter, operated in a zero-energy-loss mode with a slit width 
of 25 eV was used to acquire dose fractionated images of the GLP-IR 
TT-OAD2-bound sample without an objective aperture. Movies were 
recorded in hardware-binned mode (previously called counted mode 
onthe K2 camera) yielding a physical pixel size of 0.826 A pixel with 
an exposure time of 3.715 s amounting to a total dose of 65.6e A?ata 
dose rate of 12.2 €° pixels, which was fractionated into 62 subframes. 
Asecond dataset of 1,568 micrographs was also recorded using the 
same microscope but in ‘super-resolution’ mode on the K3 detector, 
the physical pixel size was 0.413 A with an exposure time of 4.015 s 
amounting to a total dose of 63.5e A 2, which was fractionated into 67 
subframes. Defocus range was set between —0.7 and -1.5 pm. A total of 
3,158 plus 1,568 movies were collected in two data collection sessions. 


Electron microscopy data processing 

Movies were motion-corrected with UCSF MotionCor2” (movies col- 
lected in super-resolution mode were Fourier scaled by a factor of x2 
to match the pixel size of the larger data set). This was followed by CTF 
estimation sing the GCTF software packag”. Particles were picked 
from the micrographs using the automated reference-free procedure 
in RELION**”>, Reference free 2D and 3D classification (by generating 
multiple ab initio models with no structural identity enforced) was 
carried out in CryoSPARC (v.2.5.0)**. A homogeneous subset of par- 
ticles was then subjected to movie refinement and Bayesian particle 
polishing as implemented in RELION (v.3.0). This homogeneous subset 
of polished particles was used in a 3D refinement in RELION and then 
further classified into 3D classes with alignment of Euler angles not 
taken into account. Particles belonging to the 3D class that yielded 
the best resolved map were then subjected to signal subtraction to 
subtract density due to the detergent micelle and the alpha domain 
of the G protein. Final 3D refinement was performed in RELION (3.0) 
yielded a map of resolutions 3.01A. Local resolution estimations were 
performed using the ResMAP software packag””. 


Atomic model refinement 

Fitting the model to the cryoEM electron density map was achieved 
using the MDFF routine in namd**. The fitted model was further refined 
by rounds of manual model building in coot® and real space refinement 
as implemented in the Phenix software package”, the model restraints 
for the TT ligand were prepared by using the coordinates generated 
from Chem3D and the ELBOW software package“. The ligands were 
fitted after the first round of real-space refinements, manually first in 
coot®, then refined using Phenix real-space refinement*. Ramachan- 
dran, rotamer and secondary structure restraints were applied for the 
first round of real-space refinement, and after manual inspection and 
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adjustment of the model in coot further real-space refinements were 
carried out with only Ramachandran and rotamer restraints applied 
and the model/data weight was allowed to freely refine. The density 
around the extracellular domain was poorly resolved (local resolution 
estimated at >8 A) and was not modelled. 


Modelling methods for preparation of molecular dynamic 
simulations 

The two missing receptor loops, namely the stalk region and ICL3, were 
generated using PLOP*; ICL3 was also minimized in the presence of 
Ga to eliminate steric clashes. On the basis of the electron density of 
our structures, TM] for the GLP-1-bound 5VAI structure® was replaced 
by TM1 from the P5-bound structure (PDB code 6B3J)? by the method 
of molecular superposition. The missing residues in the stalk region 
were reconstructed using Modeller** subject to the constraint that 
the high variability positions®* in the GLP-1R multiple sequence align- 
ment (E133-R134) faced outwards. The missing loops in the G protein 
were generated by molecular superposition, using VMD“~, of the cor- 
responding loops in the B,-adrenergic receptor-G protein complex”, 
PDB code 3SN6to the flank either side of the gap, since this particular 
X-ray structure (with 99% identity to the G protein used in this study) 
generally gave a lower root mean squared deviation value on molecu- 
lar superposition than plausible alternative G-protein structures (for 
example, PDB 5VAI). The joining point was taken as the closest atom 
pairs (usually separated by approximately 0.2 A) that maintained an 
appropriate Ca-Ca distance (3.7-3.9 A) across the join; selected resi- 
dues spanning the join were minimized using PLOP where additional 
refinement was deemed necessary. The exception to this was the loop 
between A249-N264, which was completed using the shorter loop from 
the adenosine A,, receptor-G-protein complex, PDB code 5G53**. The 
helical domain, between residues G47 and G207, whichis not visible in 
the cryo-EM structure, was omitted as in earlier work. 


Molecular dynamics methods 

Four GLP-1R complexes (GLP-1R-TT-OAD2-G,; GLP-IR-TT-OAD2; GLP- 
1R-GLP-1-G,; and GLP-IR-GLP-1; Supplementary Table 3) and two apo 
GLP-1R structures (obtained by removing both the G, protein and the 
ligands; Supplementary Table 3) were prepared for simulation with the 
CHARMM36 force field”, through use of in-house python htmd*? and 
TCL (Tool Command Language) scripts. The pdb2pqr™ and propka” 
software were used to add hydrogen atoms appropriate for a pH of 7.0; 
the protonation of titratable side chains was checked by visual inspec- 
tion. The coordinates were superimposed on the corresponding GLP- 
1R coordinates from the OPM database™ so as to orient the receptor 
before insertion®™ in a rectangular pre-built 125 A x 116 A 1-palmitoyl- 
2-oleyl-sn-glycerol-3-phosphocholine (POPC) bilayer; lipid molecules 
overlapping the receptor were removed. TIP3P water molecules were 
added to the 125 A x 116 Ax 195 A simulation box using the VMD Solvate 
plugin 1.5 (Solvate Plugin, v.1.5; http://www.ks.uiuc.edu/Research/vmd/ 
plugins/solvate/). Overall charge neutrality was maintained by adding 
Na‘ and CI counter ions toa final ionic concentration of 150 mM using 
the VMD Autoionize plugin 1.3 (Autoionize Plugin, v.1.3; http://www. 
ks.uiuc.edu/Research/vmd/plugins/autoionize/). CGenFF force field 
parameters* *’ and topology files for TT-OAD2 were retrieved from the 
Paramch® webserver. No further optimization was performed because 
the obtained parameters were associated to low penalty scores. 


Systems equilibration and molecular dynamics simulation 
settings 

ACEMD*® was used for both equilibration and molecular dynamics 
productive simulations. Isothermal-isobaric conditions (Langevin 
thermostat with a target temperature of 300 K and damping of 1 ps* 
and Berendsen barostat® with a target pressure 1 atm) were used to 
equilibrates the systems through a multi-stage procedure (integra- 
tion time step of 2 fs). Initial steric clashes between lipid atoms were 


reduced through 3,000 conjugate-gradient minimization steps, thena 
2ns molecular dynamics simulation was run witha positional constraint 
of 1kcal mol!A7on protein atoms and lipid phosphorus atoms. Subse- 
quently, 20 ns of molecular dynamics simulations were performed con- 
straining only the protein atoms. Inthe final equilibration stage, protein 
backbone alpha carbons constraints were applied for a further 60 ns. 

Productive trajectories in the canonical ensemble (NVT) at 300 K 
(four 500-ns-long replicas for each GLP-1R complex; Supplementary 
Table 3) were computed using a thermostat damping of 0.1 ps‘ with 
an integration time step of 4 fs and the M-SHAKE algorithm” to con- 
strain the bond lengths involving hydrogen atoms. The cut-off distance 
for electrostatic interactions was set at 9 A, with a switching function 
applied beyond 7.5 A. Long-range Coulomb interactions were handled 
using the particle mesh Ewald summation method (PME)” by setting 
the mesh spacing to1.0A. Trajectory frames were written every 100 ps 
of simulations. 


Molecular dynamics analysis 

The first half (500 ns) of the molecular dynamics replicas involving 
GLP-1R-TT-OAD2, GLP-IR-GLP-1 complexes as well as the apo-GLP-1IR 
(TT-OAD2), and apo-GLP-1R (GLP-1) systems (Supplementary Table 3) 
were considered as part of the equilibration stage and therefore not 
considered for analysis. Atomic contacts (atom distance less than 3.5 A) 
were computed using VMD*. Hydrogen bonds were identified using 
the GetContacts analysis tool (https://getcontacts.github.io/), withthe 
donor-acceptor distance set to 3.3 A and the angle set to 150°. Videos 
were generated using VMD* and avconv (https://libav.org/avconv. 
html). Root mean square fluctuation (RMSF) values were computed 
using VM“ after superposition of the molecular dynamic trajectories 
frames on the alpha carbon of the transmembrane domain (residues 
E138!3-V404’°°). The orientation of the N-terminal helix of the ECD 
of GLP-1R was drawn in VMD considering a representative frame every 
10 ns. To detect volumes within the transmembrane domain of GLP- 
1R occupied by water molecules with low mobility (structural water 
molecules), the AquaMMapS® analysis was performed on 10-ns-long 
molecular dynamics simulations of the GLP-1R-TT-OAD2-G, and 
GLP-1R-GLP-1-G, complexes (coordinates were written every 10 ps of 
simulation); all the alpha carbons were restrained in analogy with the 
approach proposed previously™. 


Whole-cell radioligand binding assays 

HEK293 cells (confirmed mycoplasma negative) were seeded at 30,000 
cells per well in 96-well culture plates and incubated overnight in DMEM 
containing 5% FBS at 37 °C, 5% CO,. Media was replaced with HBSS con- 
taining 25 mM HEPES and 0.1% (w/v) BSA with 0.1nM ™I-exendin(9-39) 
and increasing concentrations of unlabelled agonist. Cells were incu- 
bated overnight at 4 °C, washed three times in ice-cold buffer and then 
solubilized in 0.1 M NaOH. Radioactivity was determined by gamma 
counting. Non-specific activity was defined using 1 1M exendin(9-39). 


cAMP accumulation assays 

HEK293 cells (confirmed mycoplasma negative) were seeded at a den- 
sity of 30,000 cells per well into 96-well culture plates and incubated 
overnight in DMEM containing 5% FBS at 37 °C in 5% CO,. cAMP detec- 
tion was performed as previously described in the presence of the 
phosphodiesterase inhibitor 3-isobutyl-1-methylxanthin®. All values 
were converted to cAMP concentration using a cAMP standard curve 
performed in parallel and data were subsequently normalized to the 
response of 100 uM forskolin in each cell line. In one series of experi- 
ments, vehicle or increasing concentrations of TT-OAD2 was added 
30 min before assay of peptide response. 


cAMP kinetics studies 


HEK293A cells (confirmed mycoplasma negative) were transfected with 
an Epac-cAMP sensor (CAMYEL) and human GLP-1R at an optimized 


ratio. Ligand-mediated cAMP production was measured 48 h after 
transfection. In brief, culture media was replaced with assay buffer (1x 
HBSS, 10 mMMHEPES, 0.1% BSA, pH 7.4). BRET signals were measured at 
1 min intervals using a PHERAstar plate reader (BMG LabTech) in the 
absent or present of increasing concentration of ligands. Forskolin 
(100 1M) was used as a positive control, and data were normalized to 
the forskolin response. 


B-arrestin recruitment assays 

HEK293 cells (confirmed mycoplasma negative) were transiently 
transfected with GLP-1R-Rluc8 and B-arrestin1-Venus at a 1:4 ratio and 
seeded at a density of 30,000 cells per well into 96-well culture plates 
and incubated for 48 hin DMEM containing 5% FBS at 37 °C in 5% CO,. 
B-arrestin recruitment was performed as previously described®. In one 
series of experiments, vehicle or increasing concentrations of TT-OAD2 
was added 30 min before assay of peptide response. 


ERK1/2 phosphorylation assays 

HEK293 cells (confirmed mycoplasma negative) expressing stably 
expressing the GLP-1R were seeded at a density of 30,000 cells per well 
into 96-well culture plates and incubated overnight at 37 °C in5% CO,. 
Receptor-mediated pERK1/2 was determined using the AlphaScreen 
ERK1/2 SureFire protocol as previously described“. Data were normal- 
ized to the maximal response elicited by 10% FBS determined at 6 min. 
In one series of experiments, vehicle or increasing concentrations of 
TT-OAD2 was added 30 min before assay of peptide response. 


Ca‘ mobilization assays 

HEK293 cells (confirmed mycoplasma negative) stably expressing 
the GLP-1R were seeded at a density of 30,000 cells per well into 
96-well culture plates and incubated overnight at 37 °C in5% CO,, and 
receptor- mediated intracellular calcium mobilisation determined as 
previously described®. Fluorescence was determined immediately 
after ligand addition, with an excitation wavelength set to 485 nm 
and an emission wavelength set to 520 nm, and readings taken every 
1.36 s for 120 s. The peak value was used to create concentration- 
response curves. Data were normalized to the maximal response elic- 
ited by 100 pMATP. In one series of experiments, vehicle or increasing 
concentrations of TT-OAD2 was added 30 min before assay of peptide 
response. 


Generation of stable cell lines containing wild-type and mutant 
GLP-1R 

Mutant receptors were generated in a 2xc-Myc epitope-tagged recep- 
tor using QuikChange site-directed mutagenesis (Invitrogen) and 
sequences confirmed. Wild-type and mutant receptors were stably 
expressed in CHOFIpIn cells (confirmed mycoplasma negative) using 
the FlpIn Gateway technology system and selected using 600 pg mI 
hygromyocin B. 


NanoBRET ligand binding 

HEK293A cells were transiently transfected with Nluc-hGLP-1R. 
Forty-eight hours after transfection, cells were collected and plasma 
membrane was extracted as described previously*. Cell membrane 
(1 pg per well) was incubated with furimazine (1:1,000 dilution from 
stock) in assay buffer (1x HBSS, 10 mM HEPES, 0.1% (w/v) BSA, 1x P8340 
protease inhibitor cocktail, 1 mM DTT and 0.1 mM PMSF, pH 7.4). Rho- 
damineX-Ex4 (Rox-Ex4) was used as fluorescent ligand in the Nano- 
BRET binding assay. BRET signal between Nluc-hGLP-1R and Rox-Ex4 
was measured using PHERAstar (BMG LabTech) at 10 s interval (25 °C), 
a2 min baseline was taken before addition of Rox-Ex4 (Kj concentra- 
tion 3.16nM, determined previously), the measurement continued 
for 15 min followed by adding increasing concentration of TT-OAD2, 
or unlabelled Ex4 as a control. Data were corrected for baseline and 
vehicle treated samples. 


G-protein conformation assays 

HEK293AAS/Q/12/13 cells stably expressing GLP-1R (tested and 
confirmed to be free from mycoplasma) were transfected with a 
1:1:1 ratio of Nanoluc-Ga, (Nanoluc inserted at position 72): 
GB,:Venus-Gy, 24 h before collection and preparation of cell plasma 
membranes. Cell membrane (5 pg per well) was incubated with 
furimazine (1:1,000 dilution from stock) in assay buffer (1x HBSS, 
10 mM HEPES, 0.1% (w/v) BSA, 1x P8340 protease inhibitor cocktail, 
1mM DTT and 0.1mM PMSF, pH 7.4). The GLP-1R-induced BRET sig- 
nal between Ga, and Gy was measured at 30 °C using a PHERAstar 
(BMG LabTech). Baseline BRET measurements were taken for 2 min 
before addition of vehicle or ligand. BRET was measured at 15-s inter- 
vals for a further 7 min. All assays were performed ina final volume 
of 100 pl. 


G-protein NanoBIT assays 

HEK293A wild-type cells stably express human GLP-1R were transiently 
transfected with Ga-LgBIT, GB,, Gy,-SmBIT (1:5:5) 48 h before the assays. 
Cells were then incubated with coelenterazine H (5 uM) for1hat room 
temperature. Luminescence signals were measured using a Clariostar 
plate reader (BMG LabTech) at 30 s intervals before and after ligand 
addition (25 °C). Data were corrected to baseline and vehicle treated 
samples. 


In vivo IVGTT assays 

Intravenous glucose tolerance tests were performed in male human 
GLP-1R knock-in and knockout mice (all on C57/BL6 background”). 
Catheters were placed in the right carotid artery and left jugular vein of 
mice 6-11 months of age. Approximately one week later, mice (n=4-5 
per group) were fasted overnight and the catheters were exteriorized 
as mice acclimated to test cages. Vehicle (5% DMSO, 20% Captisol in 
NaHPO,, pH 2, 1 ml kg”), GLP-1(7-36)NH, at 10 pg kg™, GIP(1-42) at 25 
ugkg™, or OAD2 at3 mg kg ‘was administered intravenously one minute 
before glucose load (0.5 g kg). Blood samples were collected at -10, 
O, 2,4, 6,10, 20 and 30 min to determine blood glucose concentra- 
tions via glucometer (Roche, Aviva) and plasma insulin measurement 
(Alpco, 80-INSMSU-E10). All mouse experiments were performed in 
accordance with the Institutional Animal Care and Use Committee 
of Eli Lilly and Company and the NIH Guide for the Use and Care of 
Laboratory Animals. 


Data analysis 

Pharmacological data were analysed using Prism 7 (GraphPad). Con- 
centration response signalling data were analysed using a three-param- 
eter logistic equation, or via operational analysis. Changes in the rate 
of change in BRET kinetic data were fitted to one-phase association 
curve. Statistical analysis was performed with either one-way analysis 
of variance anda Dunnetts post-test or a paired t-test, and significance 
accepted at P< 0.05. 


Graphics 

Molecular graphics images were produced using the UCSF Chimera 
package from the Computer Graphics Laboratory, University of Cali- 
fornia, San Francisco (Supported by NIH P41 RR-01081). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

All relevant data are available from the authors and/or included inthe 
manuscript or Supplementary Information. Atomic coordinates and 
the cryo-EM density map have been deposited in the Protein Data Bank 
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(PDB) under accession number 6ORV and Electron Microscopy Data 
Bank (EMDB) accession EMD-20179. 
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Extended Data Fig. 1| Binding, transducer coupling and signalling mediated 
by TT-OAD2. a, Kinetic ligand-binding assay using ROX-exendin-4 as the 
fluorescent probe. TT-OAD72 is only able to partially displace the probe and 
with slower kinetics relative to exendin-4 that shows complete displacement of 
the probe with rapid kinetics. b, cAMP accumulation studies using GLP-1 and 
TT-OAD2as the agonist in wild-type HEK293 cells and HEK293 cells in which G, i; 
(AG, or all G,,;, (AG;/,./,) have been depleted using CRISPR-Cas9. c, HEK293A 
cells transiently transfected with the GLP-1R and the NanoBit constructs for Ga, 
and Ga;,, (Ga-LgBIT, Gy,-SmBIT). Luminescence signal was assessed over time 
(0-20 min) in the presence of increasing concentrations of GLP-land TT-OAD2. 
Concentration response curves are expressed as AUC (0-20 min) for each 
concentration and normalized to the negative response observed by GLP-1at 
1pM. d, Agonist-induced changes in trimeric G, protein conformation. Ligand- 
induced changes in BRET were measured in plasma membrane preparations 
performed in kinetic mode until kinetic equilibrium was reached for vehicle or 
increasing concentrations of GLP-1 (left) and TT-OAD2 (right). The addition of 
GTP dissociated the trimeric G protein complex stabilized by GLP-1-occupied 
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and TT-OAD2-occupied GLP-1R. e, Agonist-induced changes in trimeric G;, 
protein conformation. Left, ligand-induced changes in BRET were measured in 
plasma membrane preparations performed in kinetic mode until kinetic 
equilibrium witha saturating concentration of GLP-land TT-OAD2. The BRET 
signal decreased in the presence of GTP, which suggests that GTP dissociated 
the G,, protein complex stabilized by GLP-1-occupied and TT-OAD2-occupied 
GLP-1R. Quantification of the plateau (middle) and the rate of ligand-induced 
conformational change (right) for each agonist (1 uM GLP-1and10 pMTT- 
OAD2) was calculated by applying a one-phase association curve to the kinetic 
data with values from each individual experiment show in black circles. 

f, Concentration-response curves of production in live HEK293 cells 
expressing the GLP-1IR and an EPAC BRET biosensor in the presence of different 
concentrations of GLP-1and TT-OAD2. Left, cAMP response taken 25 min after 
ligand addition. Right, area under the curve (AUC) analysis of the response 
calculated as AUC across the full kinetic trace for each ligand concentration 
(from data in Fig. 2d). Dataare mean +s.e.m. of 4-6 independent experiments 
performed in duplicate or triplicate. 
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Extended Data Fig. 2| Purification, cryo-EMdataimagingandprocessingof — showing the overall nominal resolution at 3.0A.f, 3D histogram representation 


the TT-OAD2-GLP-1R-G, complex. a, Representative elution profile of Flag- of the Euler angle distribution of all the particles used for the inthe 

purified complex on Superdex 200 Increase 10/30 SEC. b, Representative reconstruction overlaid onthe density map drawn onthe same coordinate axis 
micrograph of the TT-OAD2-GLP-1R-G, complex. Red circles highlight (map is coloured according to local resolution as ing). g, Cryo-EM density map 
examples of individual particles. c, Two-dimensional class averages of the coloured according to resolution. Left, map with the GLP-1R ECD masked; right, 
complex in maltose-neopentyl glycol (MNG) micelle. d, Cryo-EM data map including the ECD of GLP-1R. 
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map and the model are shown for all seven transmembrane helices and helix 8 being the least well resolved region. 
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Extended Data Fig. 4| Cryo-EM density supports ligand interactions inthe density inthe cryo-EM map. c, Density for the ECD was visible in the cryo-EM 
TT-OAD2-binding site. a, Interacting residues predicted by LigPlot using the and supports extended interactions of the ECD with ECL1 and ECL2, as well as 
full-length model with ECD. b, The pose of TT-OAD2 and interactions with withthe ligand TT-OAD2. 

residues within TM1, TM2, TM3, ECL1and ECL2 are supported by well-resolved 


bundle 


Extended Data Fig. 5 | Comparison of the TT-OAD2-GLP-1R-G, complex with 
peptide agonist-bound GLP-1R structures and the inactive class BGPCR 
glucagon receptor transmembrane helices. a, Structures of agonist-bound 
GLP-1R; from left to right: GLP-1R (orange) bound to GLP-1 peptide (green) inan 
active conformation, GLP-1R (pink) bound to ExPS peptide (cyan) in an active 
conformation, GLP-1R (blue) bound to non-peptide TT-OAD2 (red) in an active 
conformation, GLP-1R (pale green) bound to 11-mer peptide HepPS (purple) ina 
partially active conformation. Far right, overlay of GLP-1R agonist-bound 
structures highlighting variations within the ECD position in the different 
structures. Inset, differences inthe location of the ECD are supported by 
density in the cryo-EM maps; shown are the GLP-1-bound (orange) and TT-OAD2 
bound (blue) GLP-1R. b,c, Various overlays of these structures (using the same 
colours) to compare conformational differences between the different 
structures. b, Overlay of TT-OAD2-bound GLP-1R G, structure with the full- 
length peptide bound G, structures and the inactive glucagon receptor (GCGR; 
grey) bundle reveals common conformational transitions occur in all agonist- 
bound structures relative to the inactive GCGR, but the extent of these 
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movements differ. A more open helical bundle is observed for the TT-OAD2- 
bound GLP-1R than either GLP-1- or ExP5-bound owing toa distinction inthe 
conformations of TM1, TM6, TM7 and ECL3 at the extracellular side of the 
receptor induced by the binding of the different ligands (left and middle). 
Middle, differences in the conformation of TM2 between the inactive and 
peptide-agonist-bound structures is also evident. Right, at the intracellular 
face all active structures display a similar large outward movement of TM6 and 
asmaller movement within TMS. c, Comparison of TT-OAD2-bound GLP-IR 
with the small peptide HepPS-bound GLP-1R structure. Left, TT-OAD2 and 
Hep-P5 occupy a partially overlapping binding site but promote distinct 
conformations of the ECD and transmembrane bundle of the GLP-1R. Middle, 
HepP5S engages deeper in the helical bundle than TT-OAD2 and promotes a 
more closed helical bundle owing to differences induced inthe conformation 
of TM1, TM6, TM7 and ECL3. Right, overlay of the TT-OAD2-, Hep-P5- and GLP-1- 
bound GLP-1R transmembrane bundles reveals HepP5 induces a similar 
conformation of the helical bundle to GLP-1 whereas TT-OAD2 induces a 
distinct conformation. 
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Extended Data Fig. 6 | Pharmacological responses exhibited byendogenous _ or increasing concentrations of TT-OAD2. Data were performed in HEK293A 
ligands GLP-1 and oxyntomodulin in the presence of TT-OAD2. Signalling cells stably expressing the GLP-1R, and are mean + s.e.m. of 3-4 independent 
profiles of GLP-Land oxyntomodulin, after 30 min preincubation of vehicle (0) experiments performed in duplicate. 
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Extended Data Fig. 7| GLP-1R domains are stabilized by either ligand b, GLP-1R contacts formed with membrane lipids during molecular dynamic 
contacts or lipid interactions. a, Top, RMSF values ofalphacarbonscomputed simulations of the GLP-1IR-TT-OAD2-G, and the GLP-IR-GLP-1-G, systems. Two 
during MD simulations of the GLP-IR-GLP-1-G, complex (black line) and the sides views of the receptor are shown (ribbon and transparent surface). When 
GLP-1R-TT-OAD2-G, complex (red line); transmembrane helices, intracellular bound to TT-OAD2, ECL1, TM3, the distal end of TM6, and ECL3 are more in 
loops (ICLs), and ECLs positions are indicated. Bottom left, RMSF values contact with the membrane lipids (magenta). By contrast, TMland TM7 are 
plotted on the GLP-1R structure bound to GLP-1 (transparent ribbon). Bottom more prone to interact with the membrane when GLP-1is bound (green). The 
right, RMSF values plotted on the GLP-1R structure bound to TT-OAD2 outward movement of ECL3 in the GLP-1IR-T T-OAD2-G, complex (stabilized by 
(transparent stick representation). ECL1 and ECL3 were more dynamic in the a hydrogen bond network different than GLP-1IR-GLP-1-G,; Extended Data 
GLP-1-bound receptor than the TT-OAD2-bound structure. By contrast, ECL2 Table 2) produces more interactions with the lipids, possibly further stabilizing 


and the top end of TMS were more mobile in the GLP-IR-T T-OAD2-G, complex. the open conformation of TM6, ECL3 and TM7. 
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Extended Data Fig. 8 | Dynamics of the ECD of GLP-1R. a, The vector (shown 
here asa green arrow) connecting $49*° and E34‘ alpha carbons (ECD 
N-terminal helix) are shown in the box. b, Left, ECD N-terminal helix 
orientations observed during the molecular dynamics simulation of the GLP- 
1R-GLP-1-G, (black arrows), the GLP-IR-GLP-1 complex (obtained by removing 
G protein; blue arrows), and the apo-GLP-1R (obtained by removing both the G, 
protein and GLP-1; cyan arrows) are shown onthe left viewed from the top and 
side of the bundle. The receptor is shown asa dark grey ribbon. During 
molecular dynamic simulations with GLP-1 bound, the N-terminal helix was 
oriented vertically (black and blue arrows), whereas in the apo-form the ECD 
N-terminal helix was more dynamic and experienced both openand closed 
conformations (this is analogous to the suggested ECD dynamics for the 
glucagon receptor). Right, ECD N-terminal helix orientations of the GLP-IR-TT- 
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OAD2-G, (red arrows), the GLP-IR-T T-OAD2 complex (obtained by removing 
Gprotein; orange arrows), and the apo-GLP-1R (obtained by removing both the 
G, protein and TT-OAD2; yellow arrows) are shown. The receptor is shown 

as aredribbon. The distal end (S49) of the helix was more mobile than the 
proximal one (E34*), which had an overall tendency to remainin the proximity 
ofthe TT-OAD2-binding site, driven by transient interactions with the ligand 
(Extended Data Table 1) and hydrogen bonds with the R299®? side chain 
(Extended Data Table 2). Molecular dynamics simulations therefore suggest a 
different behaviour for residue R299*2, which is stably involved in 
interactions with the peptide in the GLP-1-R-GLP-1-G, complex (Extended Data 
Table 1), and instead interacts with E34*° and other residues located at the 
ECL2 (E294®, D293 and N300*) in the GLP-IR-T T-OAD2-G, complex 
(Extended Data Table 2). 
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Extended Data Fig. 9 | Proposed activation mechanism of class BGPCRs. 
Left, inthe inactive conformation, the top of the transmembrane domain is 
stabilized by interactions of the ECD with the TM6-ECL3-TM7 region. Top, 
activation of class BGPCRs by peptides occurs viaa two domain mechanism. 
Top left, engagement of the peptide with the receptor ECD releases ECD 
constraints on the transmembrane domain promoting outward movements of 
TMI, TM6 and TM7 by peptide. Middle, interaction of the peptide N terminus in 
the bundle within TM1, TM2, TM3, TMS, TMéand TM7 promotes TM1, TM6 and 
TM7 toclose in around the peptide. Direct engagement of peptides with the 
central polar network facilitates conformational transitions required for G 


protein coupling and activation. Top right, the active conformation of the 
central polar network is stabilized by a series of structural waters. Bottom, 
interaction of the non-peptide TT-OAD2at the top of the GLP-IR 
transmembrane bundle releases ECD constraints on the transmembrane 
bundle resulting in movements of TM1, TM6 and TM7 outwards. TT-OAD2 does 
not engage TM5-TM7 and the bundle remains open. TT-OAD allosterically 
promotes conformational rearrangement of the central polar network to 
stabilize the fully active receptor conformation that allows coupling to G 
protein. Bottom right, the central polar network is stabilized by a distinct 
network of structural waters relative to peptide-mediated activation. 
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Extended Data Table 1| GLP-1R-TT-OAD2 and GLP-1R-GLP-1 contacts during molecular dynamics simulations performed on 
the GLP-1R-TT-OAD2-G, and GLP-1R-GLP-1-G, complexes 


GLP-1R:TT-OAD2 Contacts occupancy (% GLP-1R:GLP-1 Contacts occupancy (% 
(TM) Residues frames) (TM) Residues frames) 
*K 1977-67 100.0 *RIQGRE 302.9 
A20077 100.0 L388)" 288.0 
Yo 100.0 P20 273.2 
#HO3(0% 49 99.9 E384)" 266.2 
*M2047-74 99.8 ¥205 261.3 
*w297hC2 99.8 Lio" 222.5 
W203? 99.7 pia 196.2 
C2267" 99.7 ey 145 193.2 
*M233336 99.5 F3817°6 187.4 
229332 99.2 R1902-° 151.1 
L201-" 98.9 *K 197767 140.8 
Laie 98.1 *M233776 139.1 
11967. 88.6 E3877 132.1 
*Y145)40 68.9 *Y 148143 131.2 
O221="" 67.4 | WA aa 124.1 
A1997-69 52.4 O213" 111.8 
Q234337 48.6 E36463 103.7 
D1987-° 44.4 T3917-46 101.3 
T2959) 43.9 w214Fcul 93.2 
*Y 148143 43.4 V2370%° 89.6 
*V 1942-64 42.6 #2307? 87.3 
So1h? 41.5 120g" 86.6 
C2962 40.3 *V 1947.64 81.0 
W33ECD 38.7 yar 80.3 
A28ECD 36.4 W306°*¢ 76.6 
L.3a2Fe? 25.1 R376) 69.1 
Ti49 23.6 sl PAY jas 67.0 
*R299FC2 22.0 F3676*° 62.6 
*T290gRCl2 20.1 R310°4° 59.6 
E3s5-" 57.0 
Bie? 56.7 
*M204ECH 52.7 


Contacts involving the GLP-1R transmembrane domain are determined as the sum of the occupancy (reported as percentage of frames) of all the contacts involving each residue. Values higher 
than 100% indicate residues able to interact with more than one peptide side chain. A contact was considered productive if the distance between the residue and the ligand was less than 

3.5 A. Data are summarized in Fig. 4c. TT-OAD2 mainly interacted with TM2, ECL1 and TM3. Interactions with TM1 and ECL2 were present but not persistent (with the exception of W297"), 

The N-terminal helix of the ECD was engaged in (many) transient interactions. Overall, GLP-1 interacted with a different set of residues and was able to further involve TM5, TM6 and TM7. 
TT-OAD2 and GLP-1 common contact residues (indicated by an asterisk) were located at TM1, TM2 and ECL2. Ligand contacts formed via interaction with the receptor backbone rather than a 
side chain interaction are indicated by a hash symbol. 


Extended Data Table 2 | Main GLP-1R-GLP-1R intramolecular hydrogen bonds during molecular dynamics simulation 


GLP-1R:GLP-1:Gs 
GLP-1R_ residues involved in| GLP-1R:TT- GLP-1R:TT-OAD2:Gs 
intramolecular hydrogen bonds OAD2:Gs GLP-IR:GLP-1:Gs | A occupancy a 


N182?*2 Ww2744° 73 81.4 8.4 


See | 


R190? 24134 13.9) 14.2 0.3 


D19828 Oey 
D1982-68 K1972-7 
ROD ie D222FCH 
Re sa 
E2473 ie 
D293 K288*4 
E2942C12 S301 ECU 
E294EC2 K28844 
R299FCL2 E2942C12 
R299FC12 N3005C!2 
R299ECL2 E373&CL3 


D372EC3 K3837: 38 


R37 6ECL3 D3 TQECL3 


Data expressed as the occupancy (percentage of frames) in which the interactions were present in the GLP-1R-TT-OAD2-G, and GLP-1R-GLP-1-G, complexes. Differences between 
GLP-1R-GLP-1-G, and GLP-1R-TT-OAD2-G, are reported in the right column; green indicates more contacts in GLP-1 versus TT-OAD2, and red denotes more contacts in TT-OAD2 versus GLP-1. 
Grey cells indicate that hydrogen bonds were not present. 
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Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection SerialEM 
Data analysis GraphPad Prism, UCSF Chimera, VMD, GetContacts analysis tool, avconv, AquaMMapS, ACEMD, Modeller, MDFF, Coot, Phenix, Chem3D, 
Relion 3.0, CryoSPARC 2.5.0, ResMAP, MotionCor2, Gctf, EMAN2 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Atomic coordinates and the cryo-EM density map have been deposited in the Protein Data Bank under accession number 6ORV and EMDB entry ID 20179 
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Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample size calculation was not required 
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Data exclusions No data were excluded 
Replication Experimental findings were reliably reproduced 


Randomization Randomization was not required 


Blinding Blinding was not performed 


Reporting for specific materials, systems and methods 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


[| Human research participants 


Unique biological materials 


Policy information about availability of materials 


are readily avaliable 


Obtaining unique materials — De: 


Antibodies 
Antibodies used rabbit anti-Gs C-18 antibody (cat no sc-383), Santa Cruz 
mouse Penta-His antibody (cat no 34660), QIAGEN 
680RD goat anti-mouse antibody (LI-COR) 
800CW goat anti-rabbit antibody (LICOR) 
Validation All antibodies were used for Western blot analysis and have been validated. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) Cells used in assays were obtained from ATCC 
Authentication No authentication required 


Mycoplasma contamination Cell lines were tested and are free from mycoplasma contamination 


Commonly misidentified lines Cells are not listed in the database 
(See ICLAC register) 


Palaeontology 


Specimen provenance Not applicable 
Specimen deposition Not applicable 
Dating methods Not applicable 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 
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Laboratory animals C57/Bl6 
Wild animals Not applicable 
Field-collected samples Not applicable 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Not applicable 


Recruitment Not applicable 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


| | Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links Not applicable 
May remain private before publication. 


Files in database submission Not applicable 
Genome browser session Not applicable 
(e.g. UCSC) 


Methodology 


Replicates ot applicable 
Sequencing depth ot applicable 
Antibodies ot applicable 
Peak calling parameters ot applicable 
Data quality Not applicable 
Software ot applicable 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 
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ethodology 

Sample preparation ot applicable 
Instrument ot applicable 
Software ot applicable 


Cell population abundance ot applicable 


Gating strategy ot applicable 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


Magnetic resonance imaging 


Experimental design 


Design type ot applicable 
Design specifications ot applicable 
Behavioral performance measures ot applicable 
Acquisition 
maging type(s) ot applicable 
Field strength ot applicable 
Sequence & imaging parameters ot applicable 
Area of acquisition ot applicable 
Diffusion MRI Used Not used 
Preprocessing 
Preprocessing software ot applicable 
Normalization ot applicable 
Normalization template ot applicable 
Noise and artifact removal ot applicable 
Volume censoring ot applicable 


Statistical modeling & inference 


Model type and settings ot applicable 
Effect(s) tested ot applicable 
Specify type of analysis: Whole brain ROI-based Both 
Statistic type for inference ot applicable 


(See Eklund et al. 2016) 


Correction Not applicable 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 


Graph analysis 


Multivariate modeling or predictive analysis 
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Advice, technology and tools 


Work 


Send your careers story 
to: naturecareerseditor 
@nature.com 


PhD student Ty Tang in a gyroscope lab at Arizona State University in Tempe. 


WHAT I LEARNT FROM 
ASTINT ININDUSTRY 


Two years of work before graduate school gave me confidence and technical 
Skills, and made me certain that research was the right choice. By Ty Tang 


f you want to goto graduate school, 
don’t go to work first. You'll get 
addicted to the money and you 
won't be able to go back.” 

Ilistened to these words of advice 
carefully. | had spent the past several months 
asking people about their experiences of 
graduate school. The overwhelming majority 
of advice related to money. “It’s not worth the 
increase in salary”, “you'll beso poor’, “you'll 
envy your friends with jobs”. But, | wondered, 


aa 


what about the love for research and learning? 

Like many other students, I was confronted 
with a predicament: at a turbulent stage of 
my life, with uncertainty around every cor- 
ner, | was torn between applying to graduate 
schools or for full-time jobs. Like many stu- 
dents, I was terrified of committing to one or 
the other, sol applied to both. And withsome 
luck and persistence, | was offered opportuni- 
ties for both, putting me back to square one. 

One day, as my decision deadlines 
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approached, I had a realization: if I] started 
to work and became ‘addicted to the money’, 
then graduate school probably wasn’t the 
right decision to start with. Why waste up 
to six years of my life on a career path I 
was uncertain about if, in the end, money 
was worth more to me than my passion for 
science? It wasn’t exactly the intent behind 
the advice, but it nonetheless allowed me 
to proceed with some clarity. After fin- 
ishing my last undergraduate term of a 
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Work / Careers 


Ty Tang (second from right) made great friends at Epic Systems in Verona, Wisconsin. 


bioengineering degree at the University of 
Illinois at Chicago, | embarked on a journey 
to work as an interface engineer at Epic 
Systems, a health-care software company 
in Verona, Wisconsin. 


Defy expectations 


Being an interface engineer at Epic Systems 
required new skills: project management, 
writing code, working with customers and 
troubleshooting problems. | also had to 
adjust to a lack of autonomy, because what 
I did at work depended on the customers’ 
and the company’s needs. In more cynical 
moments, I imagined that I might gradu- 
ally begin to hate my job, be miserable for a 
while and regret ever choosing industry over 
academia. 

I was surprised by how wrong | was. I had 
the good fortune to make great friends, be 
surrounded by smart and driven people and 
was finally able to buy groceries without 
financial stress. But the most incredible thing 
for me was that I actually enjoyed my job. Like 
everything, it had its ups and downs, but I was 
surprised by how engaging non-academic full- 
time employment could be. I applied the same 
discipline and creative problem-solving I had 
developed as an undergraduate researcher 
and taught myself new skills as I grew pro- 
fessionally. | made contributions that felt 
meaningful, and I enjoyed working with my 
peers and customers. I looked forward to the 
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challenges and opportunities, and, overall, | 
felt happy. Working there made me realize 
some important things. 

Graduate school is often overly romanti- 
cized.I’mas guilty of this as anyone, but now 
I understand that not all research is fun and 
stimulating, and that not all work is tedious 
and soulless. 


“For me, taking a detour 
from research wasn’t a waste 
of time — not by along shot.” 


The people around you matter. Sometimes, 
even more than where you are. I found that 
being surrounded by smart, kind and passion- 
ate people makes everything easier. I learnt 
to cherish my personal relationships and the 
importance of prioritizing them over work. 

Money can buy youa lot of happiness. Some 
might argue that happiness as a result of buy- 
ing things is fleeting and shallow, but, for me, 
the value of removing financial burdens — of 
going out to eat without feeling guilty and 
being able to travel and have new experiences 
— can’t be overstated. It’s not that you canbe 
happy only if you have money, but it certainly 
helps. 

Money isn’t everything. Everyone knows 
this, but sometimes it’s hard to accept. When 
people look on social media and see their 


© 2020 Springer Nature Limited. All rights reserved. 


friends or colleagues going on extravagant 
holidays, it’s easy to think that if you could 
just win the lottery, you’d never have to worry 
about anything ever again. But, of course, it’s 
not that simple. There are things other than 
money that matter to us. 

For me, what money couldn’t buy was pas- 
sion. Eventhough! was happy, | couldn’t help 
but feel that something was missing, and that 
I wanted to do more. It wasn’t long before I 
started looking into graduate programmes 
again. Having heard me talk passionately 
about my research experiences, my boss 
wasn't at all surprised to hear my decision. 


Time to return 


After 18 months at Epic, I started going for 
interviews at graduate schools, and a few 
months later, laccepted an offer from Arizona 
State University in Tempe to study cognitive 
science. Armed with my experience in pro- 
ject management and technical writing, | 
felt much more confident and prepared for 
the challenges that awaited me in graduate 
school than I had before. | also couldn’t wait 
to apply the technical skills I’d picked up, 
and use my determination and ambition in 
another arena. 

For me, taking a detour from research 
wasn’t a waste of time — not by a long shot. 
Not only did I meet incredible people and 
improve my working habits, but I’m now 
certain that 1am where! want to be. 

When | was asking for advice about grad- 
uate school, I wish I'd been told that going 
to graduate school is not a decision you 
should rush: it’s a huge commitment and 
can be financially stressful. If you’re not sure 
whether you want to be a scientist, it’s okay 
to take some time to do something else. You 
might end up realizing that graduate school 
wasn’t the right decision for you, and that’s 
absolutely fine. 

lalso wish someone had told me not to get 
tunnel vision too early in my career. | recom- 
mend you explore your options. Try things 
you think you might like, but also try things 
you might not like. Your experiences will give 
you perspectives and skills that can give you 
an edge over your competition. 

No matter what you do, there are many 
opportunities to learn and to contribute to 
the world. Evenif you don’t workin research, 
the skills you develop as a student and ascien- 
tist will allow you to make positive changes. In 
the end, what you have to decide for yourself 
is where you want to make those changes and 
what really matters to you. 

Taking a break before graduate school 
didn’t weaken my resolve, it strengthened 
it, and! don’t think I would have been able to 
achieve what I have without that experience. 


Ty Tang is a cognitive-science PhD student at 
Arizona State University in Tempe. 
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ELEVEN TIPS FOR WORKING 
WITH LARGE DATASETS 


Big data are difficult to handle. These tips and tricks 
can smooth the way. By Anna Nowogrodzki 


ig data are everywhere in research, 
and the data sets are only getting 
bigger — and more challenging to 
work with. Unfortunately, says Tracy 
Teal, it’s a kind of labour that’s too 
often left out of scientific training. 
“It’s a mindset,” says Teal, “treating data as 
a first-class citizen.” She should know: Teal 
was until last month the executive director 
of The Carpentries, an organization in Oak- 
land, California, that teaches coding and data 
skills to researchers globally. She says there’s 
a tendency in the research community to dis- 
miss the time and effort needed to manage 
and share data, and not to regard it as a real 
part of science. But, she suggests, “we can shift 
our mindset to valuing that work as a part of 
the research process”, rather than treating it 
as an afterthought. 


Here are 11 tips for making the most of your 
large data sets. 


Cherish your data. “Keep your raw data raw: 
don’t manipulate it without having a copy,” 
says Teal. She recommends storing your data 
somewhere that creates automatic backups 
and that other laboratory members can 
access, while abiding by your institution’s 
rules on consent and data privacy. 

Because you won't need to access these data 
often, says Teal, “you can use storage options 


“We can shift our mindset to 
valuing that work as a part of 
the research process.” 
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where it can cost more money to access the 
data, but storage costs are low” — for instance, 
Amazon’s Glacier service. You could even store 
the raw data on duplicate hard drives kept in 
different locations. Storage costs for large data 
files can add up, so budget accordingly. 


Visualize the information. As data sets get big- 
ger, new wrinkles emerge, says Titus Brown, a 
bioinformatician at the University of Califor- 
nia, Davis. “At each stage, you’re going to be 
encountering new and exciting messed-up 
behaviour.” His advice: “Do a lot of graphs and 
look for outliers.” Last April, one of Brown’s stu- 
dents analysed transcriptomes — the full set of 
RNA molecules produced by a cell or organism — 
from 678 marine microorganisms suchas plank- 
ton (L. K. Johnson et al. GigaScience 8, giy158; 
2019). When Brown and his student charted 
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average values for transcript length, coverage 
and gene content, they noticed that some values 
were zero — showing where the computational 
workflow had failed and had to be re-run. 


Show your workflow. When particle physicist 
Peter Elmer helps his 11-year-old son with his 
mathematics homework, he has to remind him 
to document his steps. “He just wants to write 
down the answer,’ says Elmer, who is executive 
director of the Institute for Research and Inno- 
vation in Software for High Energy Physics at 
Princeton University in NewJersey. Researchers 
working with large data sets can benefit fromthe 
same advice that Elmer gave his son: “Showing 
your workis as importantas getting tothe end.” 
This means recording your entire data work- 
flow — which version of the data you used, the 
clean-up and quality-checking steps, and any 
processing code you ran. Such information is 
invaluable for documenting and reproducing 
your methods. Eric Lyons, a computational 
biologist at the University of Arizona in Tuc- 
son, uses the video-capture tool asciinema 
to record what he types into the command 
line, but lower-tech solutions can also work. 
A group of his colleagues, he recalls, took 
photos of their computer screen’s display and 
posted them on the lab’s group on Slack, an 
instant-messaging platform. 


Use version control. Version-control systems 
allow researchers to understand precisely how 
a file has changed over time, and who made 
the changes. But some systems limit the sizes 
of the files you can use. Harvard Dataverse 
(which is open to all researchers) and Zenodo 
can be used for version control of large files, 
says Alyssa Goodman, an astrophysicist and 
data-visualization specialist at Harvard Uni- 
versity in Cambridge, Massachusetts. Another 
option is Dat, a free peer-to-peer network for 
sharing and versioning files of any size. The sys- 
tem maintains a tamper-proof log that records 
allthe operations you perform on your file, says 
Andrew Osheroff, a core software developer at 
Dat in Copenhagen. And users can direct the 
system to archive a copy of each version of a 
file, says Dat product manager Karissa McK- 
elvey, who is based in Oakland, California. Dat 
is currently acommand-line utility, but “we’ve 
been actively revamping”, says McKelvey; the 
team hopesto release a more user-friendly front 
end later this year. 


Record metadata. “Your data are not useful 
unless people — and ‘future you’ — know what 
they are,” says Teal. That’s the job of metadata, 
which describe how observations were col- 
lected, formatted and organized. Consider 
which metadata to record before you start 
collecting, Lyons advises, and store that infor- 
mation alongside the data — either in the soft- 
ware tool used to collect the observations or in 
aREADME or another dedicated file. The Open 
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Connectome Project, led by Joshua Vogelstein, 
aneurostatistician at Johns Hopkins University 
in Baltimore, Maryland, logs its metadata ina 
structured plain-text format called JSON. What- 
ever your strategy, try to think long-term, Lyons 
says: you might one day want to integrate your 
data with those of other labs. If you're proactive 
with your metadata, that integration will be 
easier down the line. 


Automate, automate, automate. Big data 
sets are too large to comb through manually, 
so automation is key, says Shoaib Mufti, sen- 
ior director of data and technology at the Allen 
Institute for Brain Science in Seattle, Washing- 
ton. The institute’s neuroinformatics team, 
for instance, uses a template for brain-cell and 
genetics data that accepts information only in 
the correct format and type, Muftisays. When 
it’s time to integrate those data into a larger 


“Our entire suite of software 
tools to validate and ingest 
data runs inthe cloud, which 
allows us to easily scale.” 


database or collection, data-quality assurance 
steps are automated using Apache Spark and 
Apache Hbase, two open-source tools, to val- 
idate and repair data in real time. “Our entire 
suite of software tools to validate and ingest 
data runs inthe cloud, which allows us to easily 
scale,” he says. The Open Connectome Project 
also provides automated quality assurance, 
says Vogelstein — this generates visualizations 
of summary statistics that users can inspect 
before moving forward with their analyses. 


Make computing time count. Large datasets 
require high-performance computing (HPC), 
and many research institutes now have their 
own HPC facilities. The US National Science 
Foundation maintains the national HPC net- 
work XSEDE, which includes the cloud-based 
computing network Jetstream and HPC centres 
across the country. Researchers can request 
resource allocations at xsede.org, and create 
trial accounts at go.nature.com/36ufhgh. 
Other options include the US-based ACI-REF 
network, NCI Australia, the Partnership for 
Advanced Computing in Europe and ELIXIR 
networks, as well as commercial providers such 
as Amazon, Google and Microsoft. 

But when it comes to computing, time is 
money. To make the most of his computing 
time on the GenomeDK and Computerome 
clusters in Denmark, Guojie Zhang, a genomics 
researcher at the University of Copenhagen, 
says his group typically runs small-scale tests 
before migrating its analyses to the HPC net- 
work. Zhang is a member of the Vertebrate 
Genomes Project, which is seeking to assem- 
ble the genomes of about 70,000 vertebrate 
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species. “We need millions or even billions of 
computing hours,” he says. 


Capture your environment. To replicate an 
analysis later, you won't just need the same 
version of the tool you used, says Benjamin 
Haibe-Kains, a computational pharmaco- 
genomicist at the Princess Margaret Cancer 
Centre in Toronto, Canada. You'll also need 
the same operating system, and all the same 
software libraries that the tool requires. For 
this reason, he recommends working in a 
self-contained computing environment — a 
Docker container — that can be assembled 
anywhere. Haibe-Kains and his team use the 
online platform Code Ocean (which is based 
on Docker) to capture and share their virtual 
environments; other options include Binder, 
Gigantum and Nextjournal. “Ten years from 
now, you could still run that pipeline exactly 
the same way if you need to,” Haibe-Kains says. 


Don’t download the data. Downloading 
and storing large data sets is not practical. 
Researchers must run analyses remotely, close 
to where the data are stored, says Brown. Many 
big-data projects use Jupyter Notebook, which 
creates documents that combine software 
code, text and figures. Researchers can ‘spin 
up’ such documents on or near the data servers 
to do remote analyses, explore the data, and 
more, says Brown. Jupyter Notebook is not par- 
ticularly accessible to researchers who might 
be uncomfortable using a command line, 
Brown says, but there are more user-friendly 
platforms that can bridge the gap, including 
Terra and Seven Bridges Genomics. 


Start early. Data management is crucial even 
for young researchers, so start your training 
early. “People feel like they never have time to 
invest,” Elmer says, but “you save yourselftime 
in the long run”. Start with the basics of the 
commandline, plus a programming language 
suchas Python or R, whichever is more impor- 
tant to your field, he says. Lyons concurs: “Step 
one: get familiar with data from the command 
line.” In November, some of his collaborators 
who were not fluent in command-line usage 
had trouble with genomic data because 
chromosome names didn’t match across all 
their files, Lyons says. “Having some basic 
command-line skills and programming let 
me quickly correct the chromosome names.” 


Get help. Helpis available, online and off. Start 
with the online forum Stack Overflow. Consult 
your institution’s librarians about the skills you 
need andthe resources you have available, Teal 
advises. And don’t discount on-site training, 
Lyons says: “The Carpentries is a great place 
tostart.” 


Anna Nowogrodzki is a journalist based near 
Boston, Massachusetts. 
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Carla Shatz 


Photographed for Nature by 
Gabriela Hasbun. 


442 | Nature | Vol577 | 16 January 2020 


NOLS WhstA~ 


) 


his office is my hut, where! do lots of 
brainstorming to go over laboratory 
members’ experiments and results. 
It’s where ideas get put into action 
or put down onto paper. People 

say it’s cosy. 

Everyone would call me a compulsive 
person. My desk is clean, and so are my desk 
drawers. The order of my desk symbolizes 
an orderly mind. Because of that, my mind 
has room to notice things that don’t fit 
the pattern. This is partly why my lab has 
been able to think outside the box and 
has worked on themes that don’t fit the 
expected. 

lama neurobiologist who studies how 
mammalian brain circuits are tuned by 
experience and neural activity during 
development. People thought these circuits 
should all be hard-wired, so we were very 
surprised to find years ago that the adult 
patterning of a visual circuit in the brain 
is not there initially in the embryo. More 
recently, we got another shock when we 
found that these circuits are remodelled 
by molecules that are typically thought 
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of as components of the immune system. 
Now, we are working on how these immune 
molecules are somehow being used by 
neurons to stabilize or prune their synapses, 
or connections. 

Above my computer are pictures of my 
‘labsters’, as I call my lab members from 
over the years, and some of my mentors. 
There’s a special one of me and Eve Marder 
when we shared the 2016 Kavli Prize in 
Neuroscience, along with that year’s seven 
other prizewinners in various fields — all 
men in tuxedos. I like mixing the plaques of 
recognition with my lab family, because you 
don’t have one without the other. 

We control so little in life, and it gives mea 
kind of perverse pleasure to have my office 
under control. Generally, my lab members 
are amused by my neat-freak mentality. But 
the best science is done when people are 
extremely careful and precise. 


Carla Shatz is a biologist and neurobiologist 
and director of the Stanford Bio-X 
interdisciplinary centre at Stanford University 
in California. Interview by Kendall Powell. 
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The government is betting its manufacturing future on 
smart machinery and artificial intelligence to improve 
product quality and flexibility. By Sarah O'Meara 


n 2016, industrial engineer Chen-Fu Chien 
was aSked to lead a university research 
centre in Taiwan that would develop 
new manufacturing technologies using 
artificial intelligence (Al). 

Rather than aiming to publish academic 
papers, his brief was to produce ideas that could 
be quickly transferred into industrial settings, 
says Chien. His research at the National Tsing 
Hua University (NTHU) in Hsinchu City uses 
big-data analytics to make machines smarter 
through Al that lets them take decisions without 
human control. It is one of several approaches 


to creating ‘smart factories’ that use an inter- 
connected, digital network of supply systems 
— part of Taiwan’s push to improve the flexibil- 
ity, quality and efficiency of its manufacturing. 

“lam one of the few senior scientists in 
Taiwan who’s worked extensively with busi- 
ness, as well as in public research. It’s one of 
the reasons the government asked me to lead 
the project,” says Chien, whose position at 
the NTHU is endowed by the US firm Micron 
Technology in Boise, Idaho, which develops 
computer memory and storage technologies. 

Chien’s mission is a sign of how Taiwan’s 
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government wants its manufacturing industry 
to change using technologies such as cloud 
computing, big data, the Internet of Things and 
smart robots — a shift in industrial practices 
that has been dubbed Industry 4.0. Once known 
asa hub for mass-produced cheap goods, such 
as toys and electronics carrying the ubiquitous 
‘Madein Taiwan’ stamp, the island is looking to 
science to upgrade its image soit can becomea 
destination for international companies search- 
ing for futuristic manufacturing solutions. 

In 2018, Chien and his team opened 
the Artificial Intelligence for Intelligent 
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Manufacturing Systems (known as AIMS) 
research centre in Hsinchu. The laboratory 
has more than 50 staff members and Chien 
is responsible for 10 related projects in other 
research institutes across Taiwan. It is part of 
a wider initiative begun in 2018 that includes 
three further facilities developing Al technol- 
ogies in areas such as financial technology, 
health care and intelligent transportation 
systems. The whole initiative will cost the 
Taiwanese government around US$33 million 
over 5 years. 

“The Ministry of Science and Technology 
wanted our centre to help create the next gen- 
eration of intelligent manufacturing systems 
that could only be found in Taiwan,” Chien 
says. The ministry’s aim is “to use the region’s 
strength in electronics manufacturing to its 
best advantage and establish Taiwan as a key 
high-tech manufacturing hub’. 

Taiwan’s efforts to change its manufacturing 
model are timely. A global slowdown in trade 
since 2011 and a tariff war on goods traded 
between mainland China and the United States 
have pushed companies to look for alternative 
manufacturing options that are flexible, effi- 
cient and unaffected by such economic tussles. 


Diverse development 


Taiwan has been a leading manufacturer of 
electronic components since the 1990s. Its 
economy remains reliant on an industry that 
is led by the world’s largest contract electronic 
chipmaker, Taiwan Semiconductor Manufac- 
turing Company (TSMC), which supplies tech- 
nology companies such as Apple and Huawei 
and contributed more than 4% to the region’s 
gross domestic product in 2018. 
However, the growth of consumer electron- 
ics has slowed across the world in the past few 
years: many people already have smartphones, 
and so fewer are being bought. In 2016, Tai- 
wan’s newly inaugurated president, Tsai Ing- 
wen, announced that the government would 
promote a new model of economic devel- 
opment. The idea was to encourage local 


technology firms to diversify their products 
and to become more innovative and self-suffi- 
cient to boost technology ties with the United 
States and Japan. Taiwan also wants to reduce 
its reliance on mainland China (see ‘Moving 
money’). 


“Companies need to 
make manufacturing 
more intelligent so it can 
be more flexible.” 


Tsai’s 2016 strategy was followed by a 
breakneck series of policy announcements 
to encourage investment in smart machinery — 
equipment that can work with less input from 
an expensive human controller — and in other 
manufacturing technologies (see ‘Non-stop 
reforms’). 

When Taiwanese manufacturers began 
moving factories to mainland China in the 
2000s, it harmed the development of smart 
manufacturing technology on the island, 
explains Stephen Su, who runs a centre at 
Taiwan’s Industrial Technology Research Insti- 
tute (ITRI), a government-funded research 
and development centre in Hsinchu. The 
institute, founded in 1973, has acted as an 
incubator for several Taiwanese companies, 
including the TSMC. 

Nowthe government is “pouring resources 
into smart manufacturing “because it’s the 
future of production’, Su says. 

A conventional moving assembly line — 
many people using tools to complete small 
tasks ina much larger, complex process — was 
pioneered by Henry Ford to manufacture auto- 
mobiles in the United States in 1913. Invented 
at the end of the ‘second industrial revolution’ 
that saw the global spread of technologies 
such as electrical power, the assembly line is 
still used in many factories today, says Chien. 

Machines have largely replaced workers 
since the advent of the computer age, which 


” 


saw athird revolution in industry involving 
robotics and greater automation. The next 
development, known as the fourth industrial 
revolution or Industry 4.0, will use advances in 
cyber-physical systems, such as optical-fibre 
sensors on machines. These will collect and 
exchange data that can be processed by big- 
data analytics and Al technologies, enabling 
manufacturers to make flexible decisions 
about how they operate and to allocate 
resources efficiently to empower smart pro- 
duction. Taiwan is betting that the products 
of the future will be made by such intelligent 
machinery. 

Companies around the world are similarly 
re-evaluating where and how they make their 
products, says Jason Ho, general manager of 
Avectec in Zhubei City near Hsinchu, which 
offers conventional manufacturers a software 
platform to help create smart factories. In 
these, networked machines can detect their 
own faults, work more efficiently and reduce 
production costs. 

“Particularly in high-tech areas such as 
the computer industry, information and 
communications technology and consumer 
electronics, companies don’t need to focus 
onmaking more products more quickly. They 
need to make manufacturing more intelligent 
so it can be more flexible. That way, compa- 
nies can quickly adjust the product to meet the 
demands of each customer,” Ho says. 


Talent base 


Nowthat Taiwan is remaking itself as a destina- 
tion for the next generation of manufacturers, 
there’s one thing it requires more of: talent. 
Itisin urgent need of experienced engineers, 
both to design smart manufacturing technolo- 
gies andtocreate the high-tech products of the 
future, says Su. “We must invest in our scien- 
tists and engineers. There are many countries 
in southeast Asia that are also becoming more 
sophisticated in terms of manufacturing, and 
to stay competitive, it’s important to make 
investments in education,” says Su. 


NON-STOP 
REFORMS 
TAIWAN'S 
POLICIES AIM 
TO BOOST 
TECH-BASED 
INDUSTRIES 
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May 2016 

Taiwan’s government 
announces its ‘Five 
plus two’ policy — a 
plan to innovate in the 
fields of biotechnology, 
defence, green energy, 
intelligent machinery 
and the Internet of 
Things. 


16 January 2020 


November 2016 
Launch of the Digital 
Nation and Innovative 
Economic Development 
Program (DIGI+), an 
initiative to make Taiwan 
a smart digital region by 
2025. Policies include 
investment in start-up 
firms and development 
of the cybersecurity 
industry. 


February 2017 

The Smart Machinery 
Promotion Program 
is introduced. It aims 
to develop smart- 
machinery applications 
by combining 
manufacturing 
expertise with that 
from information 

and communications 
technologies. 
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July 2017 

The Ministry of Science 
and Technology 
(MOST) unveils plans to 
establish four research 
centres in artificial 
intelligence (Al). The 
initiative will cost 
US$33 million annually 
over five years. 


August 2017 

MOST announces a 
4-year, $132-million 
semiconductor 
programme to speed 
up the development of 
Al processor chips, and 
a 5-year, $517.5-million 
strategy to cultivate 

Al talent and research 
(2017 to 2021). 


SOURCE: MAINLAND AFFAIRS COUNCIL 


Mark Liu, chair of the TSMC, says the region’s 
shortage of qualified engineers and other pro- 
fessionals could hamper the industry’s devel- 
opment more than technological barriers. 

Taiwan’s government has acknowledged the 
severity of its talent deficit, andin 2018 passed 
legislation to help recruit and attract foreign 
professionals to work in Taiwan. Measures 
include the introduction of an employment 
gold card, atype of visa that entitles the holder 
to tax breaks, pension contributions and flex- 
ible residency options for family members. Of 
the 500 cards that were issued by the end of 
November 2019, just under one-fifth went to 
specialists in science and technology. 

Su also points to the 2018 launch of an 
industry-funded organization called the 
Taiwan Al Academy, which started as one of the 
11 project teams at the AIMS centre. It now has 
four campuses across Taiwan and runs 12-week 
courses for technical professionals or busi- 
ness managers working in the field who want 
to sharpen their skills. “Smart manufacturing 
with Al applications is one focus for the train- 
ing curriculum,’ Su says. “Many large manufac- 
turing conglomerates such as AUO, MediaTek 
and the Formosa Plastics Group have sent their 


engineers and managers there.” (The academy 
lists these firms as sponsors on its website.) 
Other initiatives will also spur the need 
for talent. In January 2019, the government 
announced a programme to encourage Tai- 
wanese businesses located overseas to relocate 
home by offering incentives such as financial 
subsidies and administrative support. As 
this article went to press, 168 companies had 
pledgedto return from mainland China. William 
Tang, aspokesperson from InvesTaiwan — the 
government agency in Taipei that runs the 
project — says those companies will together 
invest $200 billion over three years in industrial 
infrastructure, such as factories, equipment 
and telecommunications technology, with an 
estimated $6.3 billion spent in 2019. 


From the ground up 
Mechavision in Taipei City, aspin-off company 
from ITRI that specializes in industrial robotic 
arms, is one of those hoping to capitalize ona 
growing need for more advanced production 
equipment both at home and abroad. 

The three-year-old firm makes robotic tactile 
sensing technology for use where humans and 
robots collaborate in close quarters on the 


MOVING MONEY 


Taiwan has increased its investment in mainland China over the past three decades. 


Taiwanese investment in 
mainland China (US$, billion) 
foe} 


factory floor. For safety, the robots are fitted 
with an electronic skin that has tactile sensors, 
which allows the machines to detect workers 
nearby and so move away when necessary. 

Mechavision’s chief strategy officer and 
founder, Camus Su, says that one of the 
firm’s main clients is Taiwanese company 
Qisda in Taoyuan City. It makes a variety of 
products, including computer monitors and 
mobile communication devices, so has opted 
for a mixed-line system that allows different 
models to be made on the same assembly 
line. The company needs “high flexibility in 
order to cope with different product sizes and 
specifications,” he says, “and right now, our 
human-robot collaboration system is the best 
way to do this.” 

Min Sun, chief Al scientist at Appier, a 
Taiwanese company that uses Alto help enter- 
prises solve business problems, says that in 
one respect, advanced computing technology 
has helped to transform the region’s industrial 
model as computers get better at performing 
complex manufacturing tasks. 

Unlike in the United States or mainland 
China, where one spoken language is dom- 
inant, Taiwan and its neighbours in south- 
east Asia operate in hundreds of languages, 
which makes it harder to collect surveys and 
feedback in a large, usable data set for devel- 
oping new software and hardware, says Sun, 
who was previously a researcher in electrical 
engineering at the NTHU. “In recent years, it’s 
madeit harder to gather the market knowledge 
needed to create new products, such as con- 
sumer electronics like tablets or headphones,” 
he says. “Now, you don’t need so much human 
resource to convert the information into usa- 
ble data. Al can understand all the different 
languages at once.” 


Sarah O'Meara is a writer and editor in London. 
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out its three-year Al 
Taiwan Action Plan to 
establish the island as 
an Al leader. It aims to 
prioritize Al innovation 
and implementation in 
industry during 2018-21 
and invest more than 
$300 million into the 
field. 


regulations for the 
employment of 
foreign professionals 
to encourage talent 
recruitment. 


promotes policies to 
incentivize investment 
in start-ups. These aim 
to provide capital, relax 
visa and employment 
laws, encourage local 
and foreign cooperation 
and help investors to 
cash in on successes. 


Taiwan Tech Arena to 
support start-ups in Al, 
semiconductors and 
software development 
and to help them to 
forge international links. 
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scheme to relocate 
Taiwanese businesses 
back to the region. 
Policies involve 
assisting firms with 
resources such as 
land, water, electricity, 
labour, taxation and 
financing. 
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approves a 4-year, 
$658-million spending 
plan for 5G technology 
to increase the region’s 
digital competitiveness. 
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